Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 396030 |
| Missing cells | 81592 |
| Missing cells (%) | 0.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 405.1 MiB |
| Average record size in memory | 1.0 KiB |
Variable types
| CAT | 14 |
|---|---|
| NUM | 12 |
| BOOL | 1 |
emp_title has a high cardinality: 173104 distinct values | High cardinality |
issue_d has a high cardinality: 115 distinct values | High cardinality |
title has a high cardinality: 48817 distinct values | High cardinality |
earliest_cr_line has a high cardinality: 684 distinct values | High cardinality |
address has a high cardinality: 393700 distinct values | High cardinality |
installment is highly correlated with loan_amnt | High correlation |
loan_amnt is highly correlated with installment | High correlation |
sub_grade is highly correlated with grade | High correlation |
grade is highly correlated with sub_grade | High correlation |
emp_title has 22930 (5.8%) missing values | Missing |
emp_length has 18301 (4.6%) missing values | Missing |
mort_acc has 37795 (9.5%) missing values | Missing |
annual_inc is highly skewed (γ1 = 41.04272475) | Skewed |
dti is highly skewed (γ1 = 431.0512254) | Skewed |
address is uniformly distributed | Uniform |
pub_rec has 338272 (85.4%) zeros | Zeros |
mort_acc has 139777 (35.3%) zeros | Zeros |
pub_rec_bankruptcies has 350380 (88.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-01-02 18:05:35.073646 |
|---|---|
| Analysis finished | 2021-01-02 18:07:07.865054 |
| Duration | 1 minute and 32.79 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 1397 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14113.88809 |
|---|---|
| Minimum | 500 |
| Maximum | 40000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 3250 |
| Q1 | 8000 |
| median | 12000 |
| Q3 | 20000 |
| 95-th percentile | 30975 |
| Maximum | 40000 |
| Range | 39500 |
| Interquartile range (IQR) | 12000 |
Descriptive statistics
| Standard deviation | 8357.441341 |
|---|---|
| Coefficient of variation (CV) | 0.5921430926 |
| Kurtosis | -0.06259753499 |
| Mean | 14113.88809 |
| Median Absolute Deviation (MAD) | 5500 |
| Skewness | 0.7772854671 |
| Sum | 5589523100 |
| Variance | 69846825.77 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10000 | 27668 | 7.0% | |
| 12000 | 21366 | 5.4% | |
| 15000 | 19903 | 5.0% | |
| 20000 | 18969 | 4.8% | |
| 35000 | 14576 | 3.7% | |
| 8000 | 13539 | 3.4% | |
| 6000 | 12734 | 3.2% | |
| 5000 | 12443 | 3.1% | |
| 16000 | 10129 | 2.6% | |
| 18000 | 9195 | 2.3% | |
| 25000 | 9067 | 2.3% | |
| 24000 | 8684 | 2.2% | |
| 30000 | 6860 | 1.7% | |
| 7000 | 6744 | 1.7% | |
| 14000 | 5963 | 1.5% | |
| 28000 | 5517 | 1.4% | |
| 9000 | 5491 | 1.4% | |
| 4000 | 5400 | 1.4% | |
| 21000 | 5090 | 1.3% | |
| 3000 | 4888 | 1.2% | |
| 13000 | 3377 | 0.9% | |
| 9600 | 3328 | 0.8% | |
| 7200 | 3219 | 0.8% | |
| 11000 | 3200 | 0.8% | |
| 2000 | 2647 | 0.7% | |
| Other values (1372) | 156033 | 39.4% |
| Value | Count | Frequency (%) | |
| 500 | 4 | < 0.1% | |
| 700 | 1 | < 0.1% | |
| 725 | 1 | < 0.1% | |
| 750 | 1 | < 0.1% | |
| 800 | 1 | < 0.1% | |
| 900 | 1 | < 0.1% | |
| 950 | 1 | < 0.1% | |
| 1000 | 1448 | 0.4% | |
| 1025 | 4 | < 0.1% | |
| 1050 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40000 | 180 | < 0.1% | |
| 39700 | 1 | < 0.1% | |
| 39600 | 1 | < 0.1% | |
| 39500 | 1 | < 0.1% | |
| 39475 | 1 | < 0.1% | |
| 39200 | 1 | < 0.1% | |
| 38825 | 1 | < 0.1% | |
| 38750 | 1 | < 0.1% | |
| 38475 | 1 | < 0.1% | |
| 38300 | 1 | < 0.1% |
term
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| 36 months | |
|---|---|
| 60 months |
| Value | Count | Frequency (%) | |
| 36 months | 302005 | 76.3% | |
| 60 months | 94025 | 23.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 792060 | 20.0% | ||
| 6 | 396030 | 10.0% | |
| m | 396030 | 10.0% | |
| o | 396030 | 10.0% | |
| n | 396030 | 10.0% | |
| t | 396030 | 10.0% | |
| h | 396030 | 10.0% | |
| s | 396030 | 10.0% | |
| 3 | 302005 | 7.6% | |
| 0 | 94025 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2376180 | 60.0% | |
| Space Separator | 792060 | 20.0% | |
| Decimal Number | 792060 | 20.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 792060 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 6 | 396030 | 50.0% | |
| 3 | 302005 | 38.1% | |
| 0 | 94025 | 11.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| m | 396030 | 16.7% | |
| o | 396030 | 16.7% | |
| n | 396030 | 16.7% | |
| t | 396030 | 16.7% | |
| h | 396030 | 16.7% | |
| s | 396030 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2376180 | 60.0% | |
| Common | 1584120 | 40.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 792060 | 50.0% | ||
| 6 | 396030 | 25.0% | |
| 3 | 302005 | 19.1% | |
| 0 | 94025 | 5.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| m | 396030 | 16.7% | |
| o | 396030 | 16.7% | |
| n | 396030 | 16.7% | |
| t | 396030 | 16.7% | |
| h | 396030 | 16.7% | |
| s | 396030 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3960300 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 792060 | 20.0% | ||
| 6 | 396030 | 10.0% | |
| m | 396030 | 10.0% | |
| o | 396030 | 10.0% | |
| n | 396030 | 10.0% | |
| t | 396030 | 10.0% | |
| h | 396030 | 10.0% | |
| s | 396030 | 10.0% | |
| 3 | 302005 | 7.6% | |
| 0 | 94025 | 2.4% |
int_rate
Real number (ℝ≥0)
| Distinct | 566 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.63940005 |
|---|---|
| Minimum | 5.32 |
| Maximum | 30.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 5.32 |
|---|---|
| 5-th percentile | 6.89 |
| Q1 | 10.49 |
| median | 13.33 |
| Q3 | 16.49 |
| 95-th percentile | 21.97 |
| Maximum | 30.99 |
| Range | 25.67 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.472157382 |
|---|---|
| Coefficient of variation (CV) | 0.3278851978 |
| Kurtosis | -0.1439465381 |
| Mean | 13.63940005 |
| Median Absolute Deviation (MAD) | 3.08 |
| Skewness | 0.420669472 |
| Sum | 5401611.6 |
| Variance | 20.00019165 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10.99 | 12411 | 3.1% | |
| 12.99 | 9632 | 2.4% | |
| 15.61 | 9350 | 2.4% | |
| 11.99 | 8582 | 2.2% | |
| 8.9 | 8019 | 2.0% | |
| 12.12 | 7358 | 1.9% | |
| 7.9 | 7332 | 1.9% | |
| 16.29 | 6632 | 1.7% | |
| 13.11 | 6580 | 1.7% | |
| 6.03 | 6291 | 1.6% | |
| 17.57 | 6212 | 1.6% | |
| 15.31 | 6110 | 1.5% | |
| 9.17 | 6108 | 1.5% | |
| 13.99 | 5722 | 1.4% | |
| 14.33 | 5670 | 1.4% | |
| 16.99 | 5644 | 1.4% | |
| 18.25 | 5253 | 1.3% | |
| 9.99 | 5248 | 1.3% | |
| 11.14 | 5240 | 1.3% | |
| 7.62 | 4839 | 1.2% | |
| 12.69 | 4787 | 1.2% | |
| 13.98 | 4586 | 1.2% | |
| 14.65 | 4325 | 1.1% | |
| 12.49 | 4207 | 1.1% | |
| 7.89 | 4193 | 1.1% | |
| Other values (541) | 235699 | 59.5% |
| Value | Count | Frequency (%) | |
| 5.32 | 2440 | 0.6% | |
| 5.42 | 465 | 0.1% | |
| 5.79 | 333 | 0.1% | |
| 5.93 | 431 | 0.1% | |
| 5.99 | 278 | 0.1% | |
| 6 | 70 | < 0.1% | |
| 6.03 | 6291 | 1.6% | |
| 6.17 | 220 | 0.1% | |
| 6.24 | 1184 | 0.3% | |
| 6.39 | 656 | 0.2% |
| Value | Count | Frequency (%) | |
| 30.99 | 13 | < 0.1% | |
| 30.94 | 3 | < 0.1% | |
| 30.89 | 3 | < 0.1% | |
| 30.84 | 1 | < 0.1% | |
| 30.79 | 9 | < 0.1% | |
| 30.74 | 4 | < 0.1% | |
| 30.49 | 5 | < 0.1% | |
| 29.99 | 7 | < 0.1% | |
| 29.96 | 8 | < 0.1% | |
| 29.67 | 15 | < 0.1% |
| Distinct | 55706 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 431.849698 |
|---|---|
| Minimum | 16.08 |
| Maximum | 1533.81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 16.08 |
|---|---|
| 5-th percentile | 109.51 |
| Q1 | 250.33 |
| median | 375.43 |
| Q3 | 567.3 |
| 95-th percentile | 925.6 |
| Maximum | 1533.81 |
| Range | 1517.73 |
| Interquartile range (IQR) | 316.97 |
Descriptive statistics
| Standard deviation | 250.7277895 |
|---|---|
| Coefficient of variation (CV) | 0.5805904014 |
| Kurtosis | 0.7838199213 |
| Mean | 431.849698 |
| Median Absolute Deviation (MAD) | 150.5 |
| Skewness | 0.9835981609 |
| Sum | 171025435.9 |
| Variance | 62864.42443 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 327.34 | 968 | 0.2% | |
| 332.1 | 791 | 0.2% | |
| 491.01 | 736 | 0.2% | |
| 336.9 | 686 | 0.2% | |
| 392.81 | 683 | 0.2% | |
| 332.72 | 641 | 0.2% | |
| 337.47 | 624 | 0.2% | |
| 317.54 | 574 | 0.1% | |
| 654.68 | 556 | 0.1% | |
| 261.88 | 527 | 0.1% | |
| 196.41 | 525 | 0.1% | |
| 399.26 | 523 | 0.1% | |
| 498.15 | 523 | 0.1% | |
| 318.79 | 514 | 0.1% | |
| 163.67 | 500 | 0.1% | |
| 635.07 | 500 | 0.1% | |
| 381.04 | 491 | 0.1% | |
| 625.81 | 488 | 0.1% | |
| 304.36 | 484 | 0.1% | |
| 312.91 | 466 | 0.1% | |
| 328.06 | 462 | 0.1% | |
| 476.3 | 455 | 0.1% | |
| 348.18 | 455 | 0.1% | |
| 343.39 | 447 | 0.1% | |
| 398.52 | 447 | 0.1% | |
| Other values (55681) | 381964 | 96.4% |
| Value | Count | Frequency (%) | |
| 16.08 | 1 | < 0.1% | |
| 16.25 | 1 | < 0.1% | |
| 16.31 | 1 | < 0.1% | |
| 16.47 | 1 | < 0.1% | |
| 19.87 | 1 | < 0.1% | |
| 20.22 | 1 | < 0.1% | |
| 21.25 | 1 | < 0.1% | |
| 21.62 | 1 | < 0.1% | |
| 21.99 | 1 | < 0.1% | |
| 22.24 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1533.81 | 1 | < 0.1% | |
| 1527 | 1 | < 0.1% | |
| 1503.85 | 1 | < 0.1% | |
| 1479.49 | 1 | < 0.1% | |
| 1464.42 | 1 | < 0.1% | |
| 1458.25 | 1 | < 0.1% | |
| 1451.14 | 2 | < 0.1% | |
| 1451.12 | 2 | < 0.1% | |
| 1445.9 | 1 | < 0.1% | |
| 1443.76 | 1 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| B | |
|---|---|
| C | |
| A | |
| D | |
| E | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 396030 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 396030 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 396030 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| B3 | 26655 |
|---|---|
| B4 | 25601 |
| C1 | 23662 |
| C2 | 22580 |
| B2 | 22495 |
| Other values (30) |
| Value | Count | Frequency (%) | |
| B3 | 26655 | 6.7% | |
| B4 | 25601 | 6.5% | |
| C1 | 23662 | 6.0% | |
| C2 | 22580 | 5.7% | |
| B2 | 22495 | 5.7% | |
| B5 | 22085 | 5.6% | |
| C3 | 21221 | 5.4% | |
| C4 | 20280 | 5.1% | |
| B1 | 19182 | 4.8% | |
| A5 | 18526 | 4.7% | |
| C5 | 18244 | 4.6% | |
| D1 | 15993 | 4.0% | |
| A4 | 15789 | 4.0% | |
| D2 | 13951 | 3.5% | |
| D3 | 12223 | 3.1% | |
| D4 | 11657 | 2.9% | |
| A3 | 10576 | 2.7% | |
| A1 | 9729 | 2.5% | |
| D5 | 9700 | 2.4% | |
| A2 | 9567 | 2.4% | |
| E1 | 7917 | 2.0% | |
| E2 | 7431 | 1.9% | |
| E3 | 6207 | 1.6% | |
| E4 | 5361 | 1.4% | |
| E5 | 4572 | 1.2% | |
| Other values (10) | 14826 | 3.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| B | 116018 | 14.6% | |
| C | 105987 | 13.4% | |
| 1 | 81077 | 10.2% | |
| 4 | 80849 | 10.2% | |
| 3 | 79720 | 10.1% | |
| 2 | 79544 | 10.0% | |
| 5 | 74840 | 9.4% | |
| A | 64187 | 8.1% | |
| D | 63524 | 8.0% | |
| E | 31488 | 4.0% | |
| F | 11772 | 1.5% | |
| G | 3054 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 396030 | 50.0% | |
| Decimal Number | 396030 | 50.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 81077 | 20.5% | |
| 4 | 80849 | 20.4% | |
| 3 | 79720 | 20.1% | |
| 2 | 79544 | 20.1% | |
| 5 | 74840 | 18.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 396030 | 50.0% | |
| Common | 396030 | 50.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| B | 116018 | 29.3% | |
| C | 105987 | 26.8% | |
| A | 64187 | 16.2% | |
| D | 63524 | 16.0% | |
| E | 31488 | 8.0% | |
| F | 11772 | 3.0% | |
| G | 3054 | 0.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 81077 | 20.5% | |
| 4 | 80849 | 20.4% | |
| 3 | 79720 | 20.1% | |
| 2 | 79544 | 20.1% | |
| 5 | 74840 | 18.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 792060 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| B | 116018 | 14.6% | |
| C | 105987 | 13.4% | |
| 1 | 81077 | 10.2% | |
| 4 | 80849 | 10.2% | |
| 3 | 79720 | 10.1% | |
| 2 | 79544 | 10.0% | |
| 5 | 74840 | 9.4% | |
| A | 64187 | 8.1% | |
| D | 63524 | 8.0% | |
| E | 31488 | 4.0% | |
| F | 11772 | 1.5% | |
| G | 3054 | 0.4% |
| Distinct | 173104 |
|---|---|
| Distinct (%) | 46.4% |
| Missing | 22930 |
| Missing (%) | 5.8% |
| Memory size | 3.0 MiB |
| Teacher | 4389 |
|---|---|
| Manager | 4250 |
| Registered Nurse | 1856 |
| RN | 1846 |
| Supervisor | 1830 |
| Other values (173099) |
| Value | Count | Frequency (%) | |
| Teacher | 4389 | 1.1% | |
| Manager | 4250 | 1.1% | |
| Registered Nurse | 1856 | 0.5% | |
| RN | 1846 | 0.5% | |
| Supervisor | 1830 | 0.5% | |
| Sales | 1638 | 0.4% | |
| Project Manager | 1505 | 0.4% | |
| Owner | 1410 | 0.4% | |
| Driver | 1339 | 0.3% | |
| Office Manager | 1218 | 0.3% | |
| manager | 1145 | 0.3% | |
| Director | 1089 | 0.3% | |
| General Manager | 1074 | 0.3% | |
| Engineer | 995 | 0.3% | |
| teacher | 962 | 0.2% | |
| driver | 882 | 0.2% | |
| Vice President | 857 | 0.2% | |
| Operations Manager | 763 | 0.2% | |
| Administrative Assistant | 756 | 0.2% | |
| Accountant | 748 | 0.2% | |
| President | 742 | 0.2% | |
| owner | 697 | 0.2% | |
| Account Manager | 692 | 0.2% | |
| Police Officer | 686 | 0.2% | |
| supervisor | 673 | 0.2% | |
| Other values (173079) | 339058 | 85.6% | |
| (Missing) | 22930 | 5.8% |
Unique
| Unique | 145247 ? |
|---|---|
| Unique (%) | 38.9% |
Length
| Max length | 78 |
|---|---|
| Median length | 15 |
| Mean length | 15.80017928 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 606206 | 9.7% | |
| 487836 | 7.8% | ||
| a | 478311 | 7.6% | |
| r | 470449 | 7.5% | |
| n | 451062 | 7.2% | |
| i | 406094 | 6.5% | |
| t | 373457 | 6.0% | |
| o | 330975 | 5.3% | |
| s | 293945 | 4.7% | |
| c | 244175 | 3.9% | |
| l | 200056 | 3.2% | |
| u | 120035 | 1.9% | |
| g | 118715 | 1.9% | |
| S | 115480 | 1.8% | |
| d | 99732 | 1.6% | |
| p | 98951 | 1.6% | |
| m | 98004 | 1.6% | |
| C | 93703 | 1.5% | |
| h | 88247 | 1.4% | |
| A | 87717 | 1.4% | |
| M | 73150 | 1.2% | |
| y | 68679 | 1.1% | |
| f | 66356 | 1.1% | |
| v | 59477 | 1.0% | |
| P | 57688 | 0.9% | |
| Other values (100) | 668845 | 10.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 4770129 | 76.2% | |
| Uppercase Letter | 939905 | 15.0% | |
| Space Separator | 487839 | 7.8% | |
| Other Punctuation | 45687 | 0.7% | |
| Decimal Number | 6383 | 0.1% | |
| Dash Punctuation | 5541 | 0.1% | |
| Open Punctuation | 847 | < 0.1% | |
| Close Punctuation | 821 | < 0.1% | |
| Math Symbol | 114 | < 0.1% | |
| Control | 30 | < 0.1% | |
| Modifier Symbol | 18 | < 0.1% | |
| Currency Symbol | 11 | < 0.1% | |
| Other Symbol | 7 | < 0.1% | |
| Connector Punctuation | 6 | < 0.1% | |
| Other Number | 4 | < 0.1% | |
| Format | 2 | < 0.1% | |
| Final Punctuation | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 115480 | 12.3% | |
| C | 93703 | 10.0% | |
| A | 87717 | 9.3% | |
| M | 73150 | 7.8% | |
| P | 57688 | 6.1% | |
| T | 54371 | 5.8% | |
| E | 50241 | 5.3% | |
| I | 49183 | 5.2% | |
| R | 48438 | 5.2% | |
| D | 45784 | 4.9% | |
| O | 36452 | 3.9% | |
| L | 34160 | 3.6% | |
| N | 33825 | 3.6% | |
| B | 27017 | 2.9% | |
| F | 25122 | 2.7% | |
| H | 24954 | 2.7% | |
| G | 19674 | 2.1% | |
| U | 17798 | 1.9% | |
| W | 12962 | 1.4% | |
| V | 12455 | 1.3% | |
| K | 5450 | 0.6% | |
| J | 5067 | 0.5% | |
| Y | 4792 | 0.5% | |
| Q | 2728 | 0.3% | |
| X | 1001 | 0.1% | |
| Other values (5) | 693 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 606206 | 12.7% | |
| a | 478311 | 10.0% | |
| r | 470449 | 9.9% | |
| n | 451062 | 9.5% | |
| i | 406094 | 8.5% | |
| t | 373457 | 7.8% | |
| o | 330975 | 6.9% | |
| s | 293945 | 6.2% | |
| c | 244175 | 5.1% | |
| l | 200056 | 4.2% | |
| u | 120035 | 2.5% | |
| g | 118715 | 2.5% | |
| d | 99732 | 2.1% | |
| p | 98951 | 2.1% | |
| m | 98004 | 2.1% | |
| h | 88247 | 1.8% | |
| y | 68679 | 1.4% | |
| f | 66356 | 1.4% | |
| v | 59477 | 1.2% | |
| k | 31079 | 0.7% | |
| b | 23452 | 0.5% | |
| w | 22366 | 0.5% | |
| x | 9561 | 0.2% | |
| j | 5664 | 0.1% | |
| z | 3024 | 0.1% | |
| Other values (8) | 2057 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 487836 | > 99.9% | ||
| 3 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 18744 | 41.0% | |
| , | 9790 | 21.4% | |
| / | 7921 | 17.3% | |
| & | 6352 | 13.9% | |
| ' | 2523 | 5.5% | |
| # | 140 | 0.3% | |
| ; | 45 | 0.1% | |
| : | 43 | 0.1% | |
| ! | 31 | 0.1% | |
| " | 28 | 0.1% | |
| \ | 26 | 0.1% | |
| @ | 18 | < 0.1% | |
| * | 16 | < 0.1% | |
| % | 4 | < 0.1% | |
| ? | 3 | < 0.1% | |
| ¡ | 2 | < 0.1% | |
| ¶ | 1 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 5541 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1428 | 22.4% | |
| 2 | 1303 | 20.4% | |
| 3 | 1002 | 15.7% | |
| 4 | 548 | 8.6% | |
| 0 | 435 | 6.8% | |
| 5 | 409 | 6.4% | |
| 6 | 385 | 6.0% | |
| 9 | 335 | 5.2% | |
| 7 | 321 | 5.0% | |
| 8 | 217 | 3.4% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 839 | 99.1% | |
| [ | 7 | 0.8% | |
| { | 1 | 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 816 | 99.4% | |
| ] | 4 | 0.5% | |
| } | 1 | 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 92 | 80.7% | |
| | | 16 | 14.0% | |
| ~ | 4 | 3.5% | |
| ¬ | 1 | 0.9% | |
| < | 1 | 0.9% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 6 | 100.0% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| | 8 | 26.7% | |
| | 7 | 23.3% | |
| | 3 | 10.0% | |
| | 2 | 6.7% | |
| | 2 | 6.7% | |
| | 2 | 6.7% | |
| 2 | 6.7% | ||
| | 1 | 3.3% | |
| 1 | 3.3% | ||
| | 1 | 3.3% | |
| | 1 | 3.3% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ² | 3 | 75.0% | |
| ³ | 1 | 25.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 8 | 72.7% | |
| ¢ | 3 | 27.3% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| © | 7 | 100.0% |
Most frequent Final Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 1 | 100.0% |
Most frequent Format characters
| Value | Count | Frequency (%) | |
| | 1 | 50.0% | |
| | 1 | 50.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 18 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5710034 | 91.3% | |
| Common | 547311 | 8.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 606206 | 10.6% | |
| a | 478311 | 8.4% | |
| r | 470449 | 8.2% | |
| n | 451062 | 7.9% | |
| i | 406094 | 7.1% | |
| t | 373457 | 6.5% | |
| o | 330975 | 5.8% | |
| s | 293945 | 5.1% | |
| c | 244175 | 4.3% | |
| l | 200056 | 3.5% | |
| u | 120035 | 2.1% | |
| g | 118715 | 2.1% | |
| S | 115480 | 2.0% | |
| d | 99732 | 1.7% | |
| p | 98951 | 1.7% | |
| m | 98004 | 1.7% | |
| C | 93703 | 1.6% | |
| h | 88247 | 1.5% | |
| A | 87717 | 1.5% | |
| M | 73150 | 1.3% | |
| y | 68679 | 1.2% | |
| f | 66356 | 1.2% | |
| v | 59477 | 1.0% | |
| P | 57688 | 1.0% | |
| T | 54371 | 1.0% | |
| Other values (38) | 554999 | 9.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 487836 | 89.1% | ||
| . | 18744 | 3.4% | |
| , | 9790 | 1.8% | |
| / | 7921 | 1.4% | |
| & | 6352 | 1.2% | |
| - | 5541 | 1.0% | |
| ' | 2523 | 0.5% | |
| 1 | 1428 | 0.3% | |
| 2 | 1303 | 0.2% | |
| 3 | 1002 | 0.2% | |
| ( | 839 | 0.2% | |
| ) | 816 | 0.1% | |
| 4 | 548 | 0.1% | |
| 0 | 435 | 0.1% | |
| 5 | 409 | 0.1% | |
| 6 | 385 | 0.1% | |
| 9 | 335 | 0.1% | |
| 7 | 321 | 0.1% | |
| 8 | 217 | < 0.1% | |
| # | 140 | < 0.1% | |
| + | 92 | < 0.1% | |
| ; | 45 | < 0.1% | |
| : | 43 | < 0.1% | |
| ! | 31 | < 0.1% | |
| " | 28 | < 0.1% | |
| Other values (37) | 187 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6257240 | > 99.9% | |
| None | 103 | < 0.1% | |
| Punctuation | 2 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 606206 | 9.7% | |
| 487836 | 7.8% | ||
| a | 478311 | 7.6% | |
| r | 470449 | 7.5% | |
| n | 451062 | 7.2% | |
| i | 406094 | 6.5% | |
| t | 373457 | 6.0% | |
| o | 330975 | 5.3% | |
| s | 293945 | 4.7% | |
| c | 244175 | 3.9% | |
| l | 200056 | 3.2% | |
| u | 120035 | 1.9% | |
| g | 118715 | 1.9% | |
| S | 115480 | 1.8% | |
| d | 99732 | 1.6% | |
| p | 98951 | 1.6% | |
| m | 98004 | 1.6% | |
| C | 93703 | 1.5% | |
| h | 88247 | 1.4% | |
| A | 87717 | 1.4% | |
| M | 73150 | 1.2% | |
| y | 68679 | 1.1% | |
| f | 66356 | 1.1% | |
| v | 59477 | 1.0% | |
| P | 57688 | 0.9% | |
| Other values (68) | 668740 | 10.7% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| Ã | 21 | 20.4% | |
| Â | 10 | 9.7% | |
| â | 8 | 7.8% | |
| | 8 | 7.8% | |
| | 7 | 6.8% | |
| © | 7 | 6.8% | |
| é | 4 | 3.9% | |
| ² | 3 | 2.9% | |
| ¢ | 3 | 2.9% | |
| 3 | 2.9% | ||
| | 3 | 2.9% | |
| ¡ | 2 | 1.9% | |
| á | 2 | 1.9% | |
| Æ | 2 | 1.9% | |
| | 2 | 1.9% | |
| | 2 | 1.9% | |
| | 2 | 1.9% | |
| ñ | 2 | 1.9% | |
| | 1 | 1.0% | |
| ¶ | 1 | 1.0% | |
| | 1 | 1.0% | |
| ¬ | 1 | 1.0% | |
| 1 | 1.0% | ||
| | 1 | 1.0% | |
| í | 1 | 1.0% | |
| Other values (5) | 5 | 4.9% |
Most frequent Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 1 | 50.0% | |
| | 1 | 50.0% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18301 |
| Missing (%) | 4.6% |
| Memory size | 3.0 MiB |
| 10+ years | |
|---|---|
| 2 years | |
| < 1 year | |
| 3 years | |
| 5 years | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| 10+ years | 126041 | 31.8% | |
| 2 years | 35827 | 9.0% | |
| < 1 year | 31725 | 8.0% | |
| 3 years | 31665 | 8.0% | |
| 5 years | 26495 | 6.7% | |
| 1 year | 25882 | 6.5% | |
| 4 years | 23952 | 6.0% | |
| 6 years | 20841 | 5.3% | |
| 7 years | 20819 | 5.3% | |
| 8 years | 19168 | 4.8% | |
| 9 years | 15314 | 3.9% | |
| (Missing) | 18301 | 4.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.466431836 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 409454 | 13.8% | ||
| a | 396030 | 13.4% | |
| y | 377729 | 12.8% | |
| e | 377729 | 12.8% | |
| r | 377729 | 12.8% | |
| s | 320122 | 10.8% | |
| 1 | 183648 | 6.2% | |
| 0 | 126041 | 4.3% | |
| + | 126041 | 4.3% | |
| n | 36602 | 1.2% | |
| 2 | 35827 | 1.2% | |
| < | 31725 | 1.1% | |
| 3 | 31665 | 1.1% | |
| 5 | 26495 | 0.9% | |
| 4 | 23952 | 0.8% | |
| 6 | 20841 | 0.7% | |
| 7 | 20819 | 0.7% | |
| 8 | 19168 | 0.6% | |
| 9 | 15314 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1885941 | 63.8% | |
| Decimal Number | 503770 | 17.0% | |
| Space Separator | 409454 | 13.8% | |
| Math Symbol | 157766 | 5.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 183648 | 36.5% | |
| 0 | 126041 | 25.0% | |
| 2 | 35827 | 7.1% | |
| 3 | 31665 | 6.3% | |
| 5 | 26495 | 5.3% | |
| 4 | 23952 | 4.8% | |
| 6 | 20841 | 4.1% | |
| 7 | 20819 | 4.1% | |
| 8 | 19168 | 3.8% | |
| 9 | 15314 | 3.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 126041 | 79.9% | |
| < | 31725 | 20.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 409454 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 396030 | 21.0% | |
| y | 377729 | 20.0% | |
| e | 377729 | 20.0% | |
| r | 377729 | 20.0% | |
| s | 320122 | 17.0% | |
| n | 36602 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1885941 | 63.8% | |
| Common | 1070990 | 36.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 409454 | 38.2% | ||
| 1 | 183648 | 17.1% | |
| 0 | 126041 | 11.8% | |
| + | 126041 | 11.8% | |
| 2 | 35827 | 3.3% | |
| < | 31725 | 3.0% | |
| 3 | 31665 | 3.0% | |
| 5 | 26495 | 2.5% | |
| 4 | 23952 | 2.2% | |
| 6 | 20841 | 1.9% | |
| 7 | 20819 | 1.9% | |
| 8 | 19168 | 1.8% | |
| 9 | 15314 | 1.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 396030 | 21.0% | |
| y | 377729 | 20.0% | |
| e | 377729 | 20.0% | |
| r | 377729 | 20.0% | |
| s | 320122 | 17.0% | |
| n | 36602 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2956931 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 409454 | 13.8% | ||
| a | 396030 | 13.4% | |
| y | 377729 | 12.8% | |
| e | 377729 | 12.8% | |
| r | 377729 | 12.8% | |
| s | 320122 | 10.8% | |
| 1 | 183648 | 6.2% | |
| 0 | 126041 | 4.3% | |
| + | 126041 | 4.3% | |
| n | 36602 | 1.2% | |
| 2 | 35827 | 1.2% | |
| < | 31725 | 1.1% | |
| 3 | 31665 | 1.1% | |
| 5 | 26495 | 0.9% | |
| 4 | 23952 | 0.8% | |
| 6 | 20841 | 0.7% | |
| 7 | 20819 | 0.7% | |
| 8 | 19168 | 0.6% | |
| 9 | 15314 | 0.5% |
home_ownership
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| MORTGAGE | |
|---|---|
| RENT | |
| OWN | |
| OTHER | 112 |
| NONE | 31 |
| Value | Count | Frequency (%) | |
| MORTGAGE | 198348 | 50.1% | |
| RENT | 159790 | 40.3% | |
| OWN | 37746 | 9.5% | |
| OTHER | 112 | < 0.1% | |
| NONE | 31 | < 0.1% | |
| ANY | 3 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 5.908327652 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| G | 396696 | 17.0% | |
| E | 358281 | 15.3% | |
| R | 358250 | 15.3% | |
| T | 358250 | 15.3% | |
| O | 236237 | 10.1% | |
| A | 198351 | 8.5% | |
| M | 198348 | 8.5% | |
| N | 197601 | 8.4% | |
| W | 37746 | 1.6% | |
| H | 112 | < 0.1% | |
| Y | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 2339875 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| G | 396696 | 17.0% | |
| E | 358281 | 15.3% | |
| R | 358250 | 15.3% | |
| T | 358250 | 15.3% | |
| O | 236237 | 10.1% | |
| A | 198351 | 8.5% | |
| M | 198348 | 8.5% | |
| N | 197601 | 8.4% | |
| W | 37746 | 1.6% | |
| H | 112 | < 0.1% | |
| Y | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2339875 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| G | 396696 | 17.0% | |
| E | 358281 | 15.3% | |
| R | 358250 | 15.3% | |
| T | 358250 | 15.3% | |
| O | 236237 | 10.1% | |
| A | 198351 | 8.5% | |
| M | 198348 | 8.5% | |
| N | 197601 | 8.4% | |
| W | 37746 | 1.6% | |
| H | 112 | < 0.1% | |
| Y | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2339875 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| G | 396696 | 17.0% | |
| E | 358281 | 15.3% | |
| R | 358250 | 15.3% | |
| T | 358250 | 15.3% | |
| O | 236237 | 10.1% | |
| A | 198351 | 8.5% | |
| M | 198348 | 8.5% | |
| N | 197601 | 8.4% | |
| W | 37746 | 1.6% | |
| H | 112 | < 0.1% | |
| Y | 3 | < 0.1% |
| Distinct | 27197 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74203.1758 |
|---|---|
| Minimum | 0 |
| Maximum | 8706582 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 28000 |
| Q1 | 45000 |
| median | 64000 |
| Q3 | 90000 |
| 95-th percentile | 150000 |
| Maximum | 8706582 |
| Range | 8706582 |
| Interquartile range (IQR) | 45000 |
Descriptive statistics
| Standard deviation | 61637.62116 |
|---|---|
| Coefficient of variation (CV) | 0.8306601503 |
| Kurtosis | 4238.550572 |
| Mean | 74203.1758 |
| Median Absolute Deviation (MAD) | 21000 |
| Skewness | 41.04272475 |
| Sum | 2.938668371e+10 |
| Variance | 3799196342 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 60000 | 15313 | 3.9% | |
| 50000 | 13303 | 3.4% | |
| 65000 | 11333 | 2.9% | |
| 70000 | 10674 | 2.7% | |
| 40000 | 10629 | 2.7% | |
| 45000 | 10114 | 2.6% | |
| 80000 | 9971 | 2.5% | |
| 75000 | 9850 | 2.5% | |
| 55000 | 9195 | 2.3% | |
| 90000 | 7573 | 1.9% | |
| 100000 | 7480 | 1.9% | |
| 85000 | 6936 | 1.8% | |
| 35000 | 6544 | 1.7% | |
| 30000 | 6250 | 1.6% | |
| 120000 | 5767 | 1.5% | |
| 52000 | 5316 | 1.3% | |
| 42000 | 5296 | 1.3% | |
| 48000 | 5048 | 1.3% | |
| 110000 | 4870 | 1.2% | |
| 72000 | 4369 | 1.1% | |
| 95000 | 4100 | 1.0% | |
| 36000 | 3666 | 0.9% | |
| 150000 | 3472 | 0.9% | |
| 62000 | 3434 | 0.9% | |
| 38000 | 3204 | 0.8% | |
| Other values (27172) | 212323 | 53.6% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 600 | 1 | < 0.1% | |
| 2500 | 1 | < 0.1% | |
| 4000 | 2 | < 0.1% | |
| 4080 | 1 | < 0.1% | |
| 4200 | 1 | < 0.1% | |
| 4524 | 1 | < 0.1% | |
| 4800 | 6 | < 0.1% | |
| 4888 | 1 | < 0.1% | |
| 5000 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 8706582 | 1 | < 0.1% | |
| 7600000 | 1 | < 0.1% | |
| 7446395 | 1 | < 0.1% | |
| 7141778 | 1 | < 0.1% | |
| 7000000 | 1 | < 0.1% | |
| 6500000 | 1 | < 0.1% | |
| 6100000 | 1 | < 0.1% | |
| 6000000 | 2 | < 0.1% | |
| 5000000 | 1 | < 0.1% | |
| 4900000 | 1 | < 0.1% |
verification_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Verified | |
|---|---|
| Source Verified | |
| Not Verified |
| Value | Count | Frequency (%) | |
| Verified | 139563 | 35.2% | |
| Source Verified | 131385 | 33.2% | |
| Not Verified | 125082 | 31.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 11.58564503 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 923445 | 20.1% | |
| i | 792060 | 17.3% | |
| r | 527415 | 11.5% | |
| V | 396030 | 8.6% | |
| f | 396030 | 8.6% | |
| d | 396030 | 8.6% | |
| o | 256467 | 5.6% | |
| 256467 | 5.6% | ||
| S | 131385 | 2.9% | |
| u | 131385 | 2.9% | |
| c | 131385 | 2.9% | |
| N | 125082 | 2.7% | |
| t | 125082 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3679299 | 80.2% | |
| Uppercase Letter | 652497 | 14.2% | |
| Space Separator | 256467 | 5.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| V | 396030 | 60.7% | |
| S | 131385 | 20.1% | |
| N | 125082 | 19.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 923445 | 25.1% | |
| i | 792060 | 21.5% | |
| r | 527415 | 14.3% | |
| f | 396030 | 10.8% | |
| d | 396030 | 10.8% | |
| o | 256467 | 7.0% | |
| u | 131385 | 3.6% | |
| c | 131385 | 3.6% | |
| t | 125082 | 3.4% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 256467 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4331796 | 94.4% | |
| Common | 256467 | 5.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 923445 | 21.3% | |
| i | 792060 | 18.3% | |
| r | 527415 | 12.2% | |
| V | 396030 | 9.1% | |
| f | 396030 | 9.1% | |
| d | 396030 | 9.1% | |
| o | 256467 | 5.9% | |
| S | 131385 | 3.0% | |
| u | 131385 | 3.0% | |
| c | 131385 | 3.0% | |
| N | 125082 | 2.9% | |
| t | 125082 | 2.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 256467 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4588263 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 923445 | 20.1% | |
| i | 792060 | 17.3% | |
| r | 527415 | 11.5% | |
| V | 396030 | 8.6% | |
| f | 396030 | 8.6% | |
| d | 396030 | 8.6% | |
| o | 256467 | 5.6% | |
| 256467 | 5.6% | ||
| S | 131385 | 2.9% | |
| u | 131385 | 2.9% | |
| c | 131385 | 2.9% | |
| N | 125082 | 2.7% | |
| t | 125082 | 2.7% |
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Oct-2014 | 14846 |
|---|---|
| Jul-2014 | 12609 |
| Jan-2015 | 11705 |
| Dec-2013 | 10618 |
| Nov-2013 | 10496 |
| Other values (110) |
| Value | Count | Frequency (%) | |
| Oct-2014 | 14846 | 3.7% | |
| Jul-2014 | 12609 | 3.2% | |
| Jan-2015 | 11705 | 3.0% | |
| Dec-2013 | 10618 | 2.7% | |
| Nov-2013 | 10496 | 2.7% | |
| Jul-2015 | 10270 | 2.6% | |
| Oct-2013 | 10047 | 2.5% | |
| Jan-2014 | 9705 | 2.5% | |
| Apr-2015 | 9470 | 2.4% | |
| Sep-2013 | 9179 | 2.3% | |
| Aug-2013 | 9112 | 2.3% | |
| Apr-2014 | 9020 | 2.3% | |
| Nov-2014 | 8858 | 2.2% | |
| May-2014 | 8840 | 2.2% | |
| Jul-2013 | 8631 | 2.2% | |
| Oct-2015 | 8401 | 2.1% | |
| May-2015 | 8325 | 2.1% | |
| Mar-2014 | 8108 | 2.0% | |
| Jun-2013 | 7947 | 2.0% | |
| Aug-2014 | 7860 | 2.0% | |
| Feb-2014 | 7624 | 1.9% | |
| Jun-2014 | 7610 | 1.9% | |
| May-2013 | 7567 | 1.9% | |
| Mar-2015 | 7268 | 1.8% | |
| Feb-2015 | 7167 | 1.8% | |
| Other values (90) | 164747 | 41.6% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 437232 | 13.8% | |
| 0 | 410549 | 13.0% | |
| 1 | 408204 | 12.9% | |
| - | 396030 | 12.5% | |
| J | 104536 | 3.3% | |
| 4 | 102860 | 3.2% | |
| u | 102670 | 3.2% | |
| a | 98496 | 3.1% | |
| 3 | 97662 | 3.1% | |
| 5 | 94264 | 3.0% | |
| e | 85443 | 2.7% | |
| c | 71212 | 2.2% | |
| A | 66039 | 2.1% | |
| r | 65142 | 2.1% | |
| n | 64822 | 2.0% | |
| M | 63814 | 2.0% | |
| p | 60842 | 1.9% | |
| O | 42130 | 1.3% | |
| t | 42130 | 1.3% | |
| l | 39714 | 1.3% | |
| N | 34068 | 1.1% | |
| o | 34068 | 1.1% | |
| v | 34068 | 1.1% | |
| g | 32816 | 1.0% | |
| y | 31895 | 1.0% | |
| Other values (8) | 147534 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1584120 | 50.0% | |
| Lowercase Letter | 792060 | 25.0% | |
| Uppercase Letter | 396030 | 12.5% | |
| Dash Punctuation | 396030 | 12.5% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 104536 | 26.4% | |
| A | 66039 | 16.7% | |
| M | 63814 | 16.1% | |
| O | 42130 | 10.6% | |
| N | 34068 | 8.6% | |
| D | 29082 | 7.3% | |
| F | 28742 | 7.3% | |
| S | 27619 | 7.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| u | 102670 | 13.0% | |
| a | 98496 | 12.4% | |
| e | 85443 | 10.8% | |
| c | 71212 | 9.0% | |
| r | 65142 | 8.2% | |
| n | 64822 | 8.2% | |
| p | 60842 | 7.7% | |
| t | 42130 | 5.3% | |
| l | 39714 | 5.0% | |
| o | 34068 | 4.3% | |
| v | 34068 | 4.3% | |
| g | 32816 | 4.1% | |
| y | 31895 | 4.0% | |
| b | 28742 | 3.6% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 396030 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 437232 | 27.6% | |
| 0 | 410549 | 25.9% | |
| 1 | 408204 | 25.8% | |
| 4 | 102860 | 6.5% | |
| 3 | 97662 | 6.2% | |
| 5 | 94264 | 6.0% | |
| 6 | 28088 | 1.8% | |
| 9 | 3826 | 0.2% | |
| 8 | 1240 | 0.1% | |
| 7 | 195 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1980150 | 62.5% | |
| Latin | 1188090 | 37.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| J | 104536 | 8.8% | |
| u | 102670 | 8.6% | |
| a | 98496 | 8.3% | |
| e | 85443 | 7.2% | |
| c | 71212 | 6.0% | |
| A | 66039 | 5.6% | |
| r | 65142 | 5.5% | |
| n | 64822 | 5.5% | |
| M | 63814 | 5.4% | |
| p | 60842 | 5.1% | |
| O | 42130 | 3.5% | |
| t | 42130 | 3.5% | |
| l | 39714 | 3.3% | |
| N | 34068 | 2.9% | |
| o | 34068 | 2.9% | |
| v | 34068 | 2.9% | |
| g | 32816 | 2.8% | |
| y | 31895 | 2.7% | |
| D | 29082 | 2.4% | |
| F | 28742 | 2.4% | |
| b | 28742 | 2.4% | |
| S | 27619 | 2.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 437232 | 22.1% | |
| 0 | 410549 | 20.7% | |
| 1 | 408204 | 20.6% | |
| - | 396030 | 20.0% | |
| 4 | 102860 | 5.2% | |
| 3 | 97662 | 4.9% | |
| 5 | 94264 | 4.8% | |
| 6 | 28088 | 1.4% | |
| 9 | 3826 | 0.2% | |
| 8 | 1240 | 0.1% | |
| 7 | 195 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3168240 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 437232 | 13.8% | |
| 0 | 410549 | 13.0% | |
| 1 | 408204 | 12.9% | |
| - | 396030 | 12.5% | |
| J | 104536 | 3.3% | |
| 4 | 102860 | 3.2% | |
| u | 102670 | 3.2% | |
| a | 98496 | 3.1% | |
| 3 | 97662 | 3.1% | |
| 5 | 94264 | 3.0% | |
| e | 85443 | 2.7% | |
| c | 71212 | 2.2% | |
| A | 66039 | 2.1% | |
| r | 65142 | 2.1% | |
| n | 64822 | 2.0% | |
| M | 63814 | 2.0% | |
| p | 60842 | 1.9% | |
| O | 42130 | 1.3% | |
| t | 42130 | 1.3% | |
| l | 39714 | 1.3% | |
| N | 34068 | 1.1% | |
| o | 34068 | 1.1% | |
| v | 34068 | 1.1% | |
| g | 32816 | 1.0% | |
| y | 31895 | 1.0% | |
| Other values (8) | 147534 | 4.7% |
loan_status
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 318357 | 80.4% | |
| 1 | 77673 | 19.6% |
purpose
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| home_improvement | |
| other | 21185 |
| major_purchase | 8790 |
| Other values (9) |
| Value | Count | Frequency (%) | |
| debt_consolidation | 234507 | 59.2% | |
| credit_card | 83019 | 21.0% | |
| home_improvement | 24030 | 6.1% | |
| other | 21185 | 5.3% | |
| major_purchase | 8790 | 2.2% | |
| small_business | 5701 | 1.4% | |
| car | 4697 | 1.2% | |
| medical | 4196 | 1.1% | |
| moving | 2854 | 0.7% | |
| vacation | 2452 | 0.6% | |
| house | 2201 | 0.6% | |
| wedding | 1812 | 0.5% | |
| renewable_energy | 329 | 0.1% | |
| educational | 257 | 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 14.99784612 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 789320 | 13.3% | |
| d | 643129 | 10.8% | |
| t | 599957 | 10.1% | |
| i | 593335 | 10.0% | |
| n | 506778 | 8.5% | |
| e | 435403 | 7.3% | |
| c | 420937 | 7.1% | |
| _ | 356376 | 6.0% | |
| a | 355447 | 6.0% | |
| s | 268302 | 4.5% | |
| l | 250691 | 4.2% | |
| b | 240537 | 4.0% | |
| r | 234188 | 3.9% | |
| m | 93631 | 1.6% | |
| h | 56206 | 0.9% | |
| p | 32820 | 0.6% | |
| v | 29336 | 0.5% | |
| u | 16949 | 0.3% | |
| j | 8790 | 0.1% | |
| g | 4995 | 0.1% | |
| w | 2141 | < 0.1% | |
| y | 329 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5583221 | 94.0% | |
| Connector Punctuation | 356376 | 6.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 789320 | 14.1% | |
| d | 643129 | 11.5% | |
| t | 599957 | 10.7% | |
| i | 593335 | 10.6% | |
| n | 506778 | 9.1% | |
| e | 435403 | 7.8% | |
| c | 420937 | 7.5% | |
| a | 355447 | 6.4% | |
| s | 268302 | 4.8% | |
| l | 250691 | 4.5% | |
| b | 240537 | 4.3% | |
| r | 234188 | 4.2% | |
| m | 93631 | 1.7% | |
| h | 56206 | 1.0% | |
| p | 32820 | 0.6% | |
| v | 29336 | 0.5% | |
| u | 16949 | 0.3% | |
| j | 8790 | 0.2% | |
| g | 4995 | 0.1% | |
| w | 2141 | < 0.1% | |
| y | 329 | < 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 356376 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5583221 | 94.0% | |
| Common | 356376 | 6.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 789320 | 14.1% | |
| d | 643129 | 11.5% | |
| t | 599957 | 10.7% | |
| i | 593335 | 10.6% | |
| n | 506778 | 9.1% | |
| e | 435403 | 7.8% | |
| c | 420937 | 7.5% | |
| a | 355447 | 6.4% | |
| s | 268302 | 4.8% | |
| l | 250691 | 4.5% | |
| b | 240537 | 4.3% | |
| r | 234188 | 4.2% | |
| m | 93631 | 1.7% | |
| h | 56206 | 1.0% | |
| p | 32820 | 0.6% | |
| v | 29336 | 0.5% | |
| u | 16949 | 0.3% | |
| j | 8790 | 0.2% | |
| g | 4995 | 0.1% | |
| w | 2141 | < 0.1% | |
| y | 329 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 356376 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5939597 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 789320 | 13.3% | |
| d | 643129 | 10.8% | |
| t | 599957 | 10.1% | |
| i | 593335 | 10.0% | |
| n | 506778 | 8.5% | |
| e | 435403 | 7.3% | |
| c | 420937 | 7.1% | |
| _ | 356376 | 6.0% | |
| a | 355447 | 6.0% | |
| s | 268302 | 4.5% | |
| l | 250691 | 4.2% | |
| b | 240537 | 4.0% | |
| r | 234188 | 3.9% | |
| m | 93631 | 1.6% | |
| h | 56206 | 0.9% | |
| p | 32820 | 0.6% | |
| v | 29336 | 0.5% | |
| u | 16949 | 0.3% | |
| j | 8790 | 0.1% | |
| g | 4995 | 0.1% | |
| w | 2141 | < 0.1% | |
| y | 329 | < 0.1% |
| Distinct | 48817 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 1755 |
| Missing (%) | 0.4% |
| Memory size | 3.0 MiB |
| Debt consolidation | |
|---|---|
| Credit card refinancing | |
| Home improvement | |
| Other | 12930 |
| Debt Consolidation | 11608 |
| Other values (48812) |
| Value | Count | Frequency (%) | |
| Debt consolidation | 152472 | 38.5% | |
| Credit card refinancing | 51487 | 13.0% | |
| Home improvement | 15264 | 3.9% | |
| Other | 12930 | 3.3% | |
| Debt Consolidation | 11608 | 2.9% | |
| Major purchase | 4769 | 1.2% | |
| Consolidation | 3852 | 1.0% | |
| debt consolidation | 3547 | 0.9% | |
| Business | 2949 | 0.7% | |
| Debt Consolidation Loan | 2864 | 0.7% | |
| Medical expenses | 2742 | 0.7% | |
| Car financing | 2139 | 0.5% | |
| Credit Card Consolidation | 1775 | 0.4% | |
| Vacation | 1717 | 0.4% | |
| Moving and relocation | 1689 | 0.4% | |
| consolidation | 1595 | 0.4% | |
| Personal Loan | 1591 | 0.4% | |
| Consolidation Loan | 1299 | 0.3% | |
| Home Improvement | 1268 | 0.3% | |
| Home buying | 1183 | 0.3% | |
| Credit Card Refinance | 1094 | 0.3% | |
| Credit Card Payoff | 1052 | 0.3% | |
| Consolidate | 919 | 0.2% | |
| Personal | 858 | 0.2% | |
| Loan | 751 | 0.2% | |
| Other values (48792) | 110861 | 28.0% | |
| (Missing) | 1755 | 0.4% |
Unique
| Unique | 41798 ? |
|---|---|
| Unique (%) | 10.6% |
Length
| Max length | 80 |
|---|---|
| Median length | 18 |
| Mean length | 17.17798399 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 735791 | 10.8% | |
| n | 686361 | 10.1% | |
| i | 655694 | 9.6% | |
| t | 545268 | 8.0% | |
| e | 521004 | 7.7% | |
| 494561 | 7.3% | ||
| a | 447502 | 6.6% | |
| d | 386101 | 5.7% | |
| c | 322828 | 4.7% | |
| r | 295630 | 4.3% | |
| s | 262921 | 3.9% | |
| l | 249062 | 3.7% | |
| b | 201681 | 3.0% | |
| D | 187930 | 2.8% | |
| C | 131422 | 1.9% | |
| f | 97957 | 1.4% | |
| m | 81037 | 1.2% | |
| g | 75436 | 1.1% | |
| p | 48853 | 0.7% | |
| h | 33625 | 0.5% | |
| y | 29917 | 0.4% | |
| u | 27075 | 0.4% | |
| v | 26886 | 0.4% | |
| L | 26622 | 0.4% | |
| H | 24603 | 0.4% | |
| Other values (76) | 207230 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5754664 | 84.6% | |
| Uppercase Letter | 527589 | 7.8% | |
| Space Separator | 494561 | 7.3% | |
| Decimal Number | 13723 | 0.2% | |
| Other Punctuation | 9147 | 0.1% | |
| Dash Punctuation | 1929 | < 0.1% | |
| Connector Punctuation | 663 | < 0.1% | |
| Close Punctuation | 209 | < 0.1% | |
| Currency Symbol | 178 | < 0.1% | |
| Open Punctuation | 163 | < 0.1% | |
| Math Symbol | 151 | < 0.1% | |
| Control | 15 | < 0.1% | |
| Modifier Symbol | 3 | < 0.1% | |
| Other Symbol | 1 | < 0.1% | |
| Other Number | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| D | 187930 | 35.6% | |
| C | 131422 | 24.9% | |
| L | 26622 | 5.0% | |
| H | 24603 | 4.7% | |
| O | 23248 | 4.4% | |
| P | 18167 | 3.4% | |
| M | 17673 | 3.3% | |
| R | 13527 | 2.6% | |
| B | 11236 | 2.1% | |
| I | 9867 | 1.9% | |
| E | 8712 | 1.7% | |
| F | 8556 | 1.6% | |
| S | 8116 | 1.5% | |
| A | 8034 | 1.5% | |
| T | 8033 | 1.5% | |
| N | 7381 | 1.4% | |
| G | 3701 | 0.7% | |
| W | 2967 | 0.6% | |
| V | 2957 | 0.6% | |
| Y | 1543 | 0.3% | |
| U | 1394 | 0.3% | |
| K | 827 | 0.2% | |
| J | 647 | 0.1% | |
| X | 182 | < 0.1% | |
| Q | 128 | < 0.1% | |
| Other values (2) | 116 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 735791 | 12.8% | |
| n | 686361 | 11.9% | |
| i | 655694 | 11.4% | |
| t | 545268 | 9.5% | |
| e | 521004 | 9.1% | |
| a | 447502 | 7.8% | |
| d | 386101 | 6.7% | |
| c | 322828 | 5.6% | |
| r | 295630 | 5.1% | |
| s | 262921 | 4.6% | |
| l | 249062 | 4.3% | |
| b | 201681 | 3.5% | |
| f | 97957 | 1.7% | |
| m | 81037 | 1.4% | |
| g | 75436 | 1.3% | |
| p | 48853 | 0.8% | |
| h | 33625 | 0.6% | |
| y | 29917 | 0.5% | |
| u | 27075 | 0.5% | |
| v | 26886 | 0.5% | |
| w | 7125 | 0.1% | |
| j | 5925 | 0.1% | |
| x | 5731 | 0.1% | |
| k | 4482 | 0.1% | |
| q | 427 | < 0.1% | |
| Other values (2) | 345 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 494561 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ! | 2474 | 27.0% | |
| / | 1738 | 19.0% | |
| . | 1647 | 18.0% | |
| ' | 1040 | 11.4% | |
| , | 848 | 9.3% | |
| & | 778 | 8.5% | |
| % | 143 | 1.6% | |
| # | 132 | 1.4% | |
| : | 125 | 1.4% | |
| " | 108 | 1.2% | |
| ? | 38 | 0.4% | |
| ; | 35 | 0.4% | |
| * | 25 | 0.3% | |
| @ | 10 | 0.1% | |
| \ | 6 | 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 4028 | 29.4% | |
| 2 | 3446 | 25.1% | |
| 0 | 3203 | 23.3% | |
| 3 | 1306 | 9.5% | |
| 4 | 387 | 2.8% | |
| 5 | 364 | 2.7% | |
| 9 | 317 | 2.3% | |
| 6 | 291 | 2.1% | |
| 7 | 193 | 1.4% | |
| 8 | 188 | 1.4% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1929 | 100.0% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 11 | 73.3% | ||
| | 2 | 13.3% | |
| | 1 | 6.7% | |
| 1 | 6.7% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 663 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 158 | 96.9% | |
| [ | 5 | 3.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 205 | 98.1% | |
| ] | 4 | 1.9% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 178 | 100.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 103 | 68.2% | |
| = | 17 | 11.3% | |
| ~ | 9 | 6.0% | |
| < | 9 | 6.0% | |
| > | 8 | 5.3% | |
| | | 5 | 3.3% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 2 | 66.7% | |
| ^ | 1 | 33.3% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| ¦ | 1 | 100.0% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ³ | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 6282253 | 92.3% | |
| Common | 520744 | 7.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 735791 | 11.7% | |
| n | 686361 | 10.9% | |
| i | 655694 | 10.4% | |
| t | 545268 | 8.7% | |
| e | 521004 | 8.3% | |
| a | 447502 | 7.1% | |
| d | 386101 | 6.1% | |
| c | 322828 | 5.1% | |
| r | 295630 | 4.7% | |
| s | 262921 | 4.2% | |
| l | 249062 | 4.0% | |
| b | 201681 | 3.2% | |
| D | 187930 | 3.0% | |
| C | 131422 | 2.1% | |
| f | 97957 | 1.6% | |
| m | 81037 | 1.3% | |
| g | 75436 | 1.2% | |
| p | 48853 | 0.8% | |
| h | 33625 | 0.5% | |
| y | 29917 | 0.5% | |
| u | 27075 | 0.4% | |
| v | 26886 | 0.4% | |
| L | 26622 | 0.4% | |
| H | 24603 | 0.4% | |
| O | 23248 | 0.4% | |
| Other values (29) | 157799 | 2.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 494561 | 95.0% | ||
| 1 | 4028 | 0.8% | |
| 2 | 3446 | 0.7% | |
| 0 | 3203 | 0.6% | |
| ! | 2474 | 0.5% | |
| - | 1929 | 0.4% | |
| / | 1738 | 0.3% | |
| . | 1647 | 0.3% | |
| 3 | 1306 | 0.3% | |
| ' | 1040 | 0.2% | |
| , | 848 | 0.2% | |
| & | 778 | 0.1% | |
| _ | 663 | 0.1% | |
| 4 | 387 | 0.1% | |
| 5 | 364 | 0.1% | |
| 9 | 317 | 0.1% | |
| 6 | 291 | 0.1% | |
| ) | 205 | < 0.1% | |
| 7 | 193 | < 0.1% | |
| 8 | 188 | < 0.1% | |
| $ | 178 | < 0.1% | |
| ( | 158 | < 0.1% | |
| % | 143 | < 0.1% | |
| # | 132 | < 0.1% | |
| : | 125 | < 0.1% | |
| Other values (22) | 402 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6802988 | > 99.9% | |
| None | 9 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 735791 | 10.8% | |
| n | 686361 | 10.1% | |
| i | 655694 | 9.6% | |
| t | 545268 | 8.0% | |
| e | 521004 | 7.7% | |
| 494561 | 7.3% | ||
| a | 447502 | 6.6% | |
| d | 386101 | 5.7% | |
| c | 322828 | 4.7% | |
| r | 295630 | 4.3% | |
| s | 262921 | 3.9% | |
| l | 249062 | 3.7% | |
| b | 201681 | 3.0% | |
| D | 187930 | 2.8% | |
| C | 131422 | 1.9% | |
| f | 97957 | 1.4% | |
| m | 81037 | 1.2% | |
| g | 75436 | 1.1% | |
| p | 48853 | 0.7% | |
| h | 33625 | 0.5% | |
| y | 29917 | 0.4% | |
| u | 27075 | 0.4% | |
| v | 26886 | 0.4% | |
| L | 26622 | 0.4% | |
| H | 24603 | 0.4% | |
| Other values (69) | 207221 | 3.0% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| â | 2 | 22.2% | |
| | 2 | 22.2% | |
| | 1 | 11.1% | |
| 1 | 11.1% | ||
| ¦ | 1 | 11.1% | |
| Ã | 1 | 11.1% | |
| ³ | 1 | 11.1% |
| Distinct | 4262 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.37951365 |
|---|---|
| Minimum | 0 |
| Maximum | 9999 |
| Zeros | 313 |
| Zeros (%) | 0.1% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.68 |
| Q1 | 11.28 |
| median | 16.91 |
| Q3 | 22.98 |
| 95-th percentile | 31.58 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 11.7 |
Descriptive statistics
| Standard deviation | 18.01909234 |
|---|---|
| Coefficient of variation (CV) | 1.036800725 |
| Kurtosis | 237923.6765 |
| Mean | 17.37951365 |
| Median Absolute Deviation (MAD) | 5.83 |
| Skewness | 431.0512254 |
| Sum | 6882808.79 |
| Variance | 324.6876889 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 313 | 0.1% | |
| 14.4 | 310 | 0.1% | |
| 19.2 | 302 | 0.1% | |
| 16.8 | 301 | 0.1% | |
| 18 | 300 | 0.1% | |
| 20.4 | 296 | 0.1% | |
| 12 | 293 | 0.1% | |
| 13.2 | 291 | 0.1% | |
| 21.6 | 270 | 0.1% | |
| 15.6 | 266 | 0.1% | |
| 11.52 | 254 | 0.1% | |
| 10.8 | 247 | 0.1% | |
| 22.8 | 245 | 0.1% | |
| 12.48 | 245 | 0.1% | |
| 9.6 | 243 | 0.1% | |
| 17.76 | 238 | 0.1% | |
| 12.72 | 237 | 0.1% | |
| 13.68 | 233 | 0.1% | |
| 15.84 | 233 | 0.1% | |
| 16.2 | 233 | 0.1% | |
| 16.32 | 230 | 0.1% | |
| 13.92 | 225 | 0.1% | |
| 18.48 | 224 | 0.1% | |
| 20.88 | 224 | 0.1% | |
| 19.92 | 224 | 0.1% | |
| Other values (4237) | 389553 | 98.4% |
| Value | Count | Frequency (%) | |
| 0 | 313 | 0.1% | |
| 0.01 | 8 | < 0.1% | |
| 0.02 | 12 | < 0.1% | |
| 0.03 | 5 | < 0.1% | |
| 0.04 | 5 | < 0.1% | |
| 0.05 | 6 | < 0.1% | |
| 0.06 | 7 | < 0.1% | |
| 0.07 | 7 | < 0.1% | |
| 0.08 | 8 | < 0.1% | |
| 0.09 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9999 | 1 | < 0.1% | |
| 1622 | 1 | < 0.1% | |
| 380.53 | 1 | < 0.1% | |
| 189.9 | 1 | < 0.1% | |
| 145.65 | 1 | < 0.1% | |
| 138.03 | 1 | < 0.1% | |
| 120.66 | 1 | < 0.1% | |
| 107.55 | 1 | < 0.1% | |
| 93.86 | 1 | < 0.1% | |
| 92.13 | 1 | < 0.1% |
| Distinct | 684 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Oct-2000 | 3017 |
|---|---|
| Aug-2000 | 2935 |
| Oct-2001 | 2896 |
| Aug-2001 | 2884 |
| Nov-2000 | 2736 |
| Other values (679) |
| Value | Count | Frequency (%) | |
| Oct-2000 | 3017 | 0.8% | |
| Aug-2000 | 2935 | 0.7% | |
| Oct-2001 | 2896 | 0.7% | |
| Aug-2001 | 2884 | 0.7% | |
| Nov-2000 | 2736 | 0.7% | |
| Oct-1999 | 2726 | 0.7% | |
| Nov-1999 | 2700 | 0.7% | |
| Sep-2000 | 2691 | 0.7% | |
| Oct-2002 | 2640 | 0.7% | |
| Aug-2002 | 2599 | 0.7% | |
| Sep-2001 | 2565 | 0.6% | |
| Aug-1999 | 2548 | 0.6% | |
| Sep-1999 | 2530 | 0.6% | |
| Sep-2002 | 2530 | 0.6% | |
| Dec-2000 | 2508 | 0.6% | |
| Sep-2003 | 2491 | 0.6% | |
| Dec-1999 | 2479 | 0.6% | |
| Oct-2003 | 2439 | 0.6% | |
| Nov-2001 | 2432 | 0.6% | |
| Dec-2001 | 2423 | 0.6% | |
| Jul-2001 | 2416 | 0.6% | |
| Jul-2000 | 2369 | 0.6% | |
| Jan-2001 | 2334 | 0.6% | |
| May-2001 | 2334 | 0.6% | |
| Dec-1998 | 2329 | 0.6% | |
| Other values (659) | 331479 | 83.7% |
Unique
| Unique | 45 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 416557 | 13.1% | |
| 9 | 402384 | 12.7% | |
| - | 396030 | 12.5% | |
| 1 | 253612 | 8.0% | |
| 2 | 228260 | 7.2% | |
| e | 100403 | 3.2% | |
| u | 99766 | 3.1% | |
| J | 93111 | 2.9% | |
| a | 92756 | 2.9% | |
| 8 | 78297 | 2.5% | |
| c | 71978 | 2.3% | |
| p | 66904 | 2.1% | |
| A | 66580 | 2.1% | |
| M | 62062 | 2.0% | |
| n | 61139 | 1.9% | |
| r | 60848 | 1.9% | |
| 7 | 44922 | 1.4% | |
| 4 | 40809 | 1.3% | |
| 6 | 40321 | 1.3% | |
| 3 | 39568 | 1.2% | |
| 5 | 39390 | 1.2% | |
| O | 38291 | 1.2% | |
| t | 38291 | 1.2% | |
| S | 37673 | 1.2% | |
| g | 37349 | 1.2% | |
| Other values (8) | 260939 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1584120 | 50.0% | |
| Lowercase Letter | 792060 | 25.0% | |
| Uppercase Letter | 396030 | 12.5% | |
| Dash Punctuation | 396030 | 12.5% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 93111 | 23.5% | |
| A | 66580 | 16.8% | |
| M | 62062 | 15.7% | |
| O | 38291 | 9.7% | |
| S | 37673 | 9.5% | |
| N | 35583 | 9.0% | |
| D | 33687 | 8.5% | |
| F | 29043 | 7.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 100403 | 12.7% | |
| u | 99766 | 12.6% | |
| a | 92756 | 11.7% | |
| c | 71978 | 9.1% | |
| p | 66904 | 8.4% | |
| n | 61139 | 7.7% | |
| r | 60848 | 7.7% | |
| t | 38291 | 4.8% | |
| g | 37349 | 4.7% | |
| o | 35583 | 4.5% | |
| v | 35583 | 4.5% | |
| l | 31972 | 4.0% | |
| y | 30445 | 3.8% | |
| b | 29043 | 3.7% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 396030 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 416557 | 26.3% | |
| 9 | 402384 | 25.4% | |
| 1 | 253612 | 16.0% | |
| 2 | 228260 | 14.4% | |
| 8 | 78297 | 4.9% | |
| 7 | 44922 | 2.8% | |
| 4 | 40809 | 2.6% | |
| 6 | 40321 | 2.5% | |
| 3 | 39568 | 2.5% | |
| 5 | 39390 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1980150 | 62.5% | |
| Latin | 1188090 | 37.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 100403 | 8.5% | |
| u | 99766 | 8.4% | |
| J | 93111 | 7.8% | |
| a | 92756 | 7.8% | |
| c | 71978 | 6.1% | |
| p | 66904 | 5.6% | |
| A | 66580 | 5.6% | |
| M | 62062 | 5.2% | |
| n | 61139 | 5.1% | |
| r | 60848 | 5.1% | |
| O | 38291 | 3.2% | |
| t | 38291 | 3.2% | |
| S | 37673 | 3.2% | |
| g | 37349 | 3.1% | |
| N | 35583 | 3.0% | |
| o | 35583 | 3.0% | |
| v | 35583 | 3.0% | |
| D | 33687 | 2.8% | |
| l | 31972 | 2.7% | |
| y | 30445 | 2.6% | |
| F | 29043 | 2.4% | |
| b | 29043 | 2.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 416557 | 21.0% | |
| 9 | 402384 | 20.3% | |
| - | 396030 | 20.0% | |
| 1 | 253612 | 12.8% | |
| 2 | 228260 | 11.5% | |
| 8 | 78297 | 4.0% | |
| 7 | 44922 | 2.3% | |
| 4 | 40809 | 2.1% | |
| 6 | 40321 | 2.0% | |
| 3 | 39568 | 2.0% | |
| 5 | 39390 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3168240 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 416557 | 13.1% | |
| 9 | 402384 | 12.7% | |
| - | 396030 | 12.5% | |
| 1 | 253612 | 8.0% | |
| 2 | 228260 | 7.2% | |
| e | 100403 | 3.2% | |
| u | 99766 | 3.1% | |
| J | 93111 | 2.9% | |
| a | 92756 | 2.9% | |
| 8 | 78297 | 2.5% | |
| c | 71978 | 2.3% | |
| p | 66904 | 2.1% | |
| A | 66580 | 2.1% | |
| M | 62062 | 2.0% | |
| n | 61139 | 1.9% | |
| r | 60848 | 1.9% | |
| 7 | 44922 | 1.4% | |
| 4 | 40809 | 1.3% | |
| 6 | 40321 | 1.3% | |
| 3 | 39568 | 1.2% | |
| 5 | 39390 | 1.2% | |
| O | 38291 | 1.2% | |
| t | 38291 | 1.2% | |
| S | 37673 | 1.2% | |
| g | 37349 | 1.2% | |
| Other values (8) | 260939 | 8.2% |
open_acc
Real number (ℝ≥0)
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.3111532 |
|---|---|
| Minimum | 0 |
| Maximum | 90 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 10 |
| Q3 | 14 |
| 95-th percentile | 21 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.137648808 |
|---|---|
| Coefficient of variation (CV) | 0.4542108766 |
| Kurtosis | 2.966944774 |
| Mean | 11.3111532 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.213018844 |
| Sum | 4479556 |
| Variance | 26.39543527 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9 | 36779 | 9.3% | |
| 10 | 35441 | 8.9% | |
| 8 | 35137 | 8.9% | |
| 11 | 32695 | 8.3% | |
| 7 | 31328 | 7.9% | |
| 12 | 29157 | 7.4% | |
| 6 | 25927 | 6.5% | |
| 13 | 24983 | 6.3% | |
| 14 | 21173 | 5.3% | |
| 5 | 18308 | 4.6% | |
| 15 | 17347 | 4.4% | |
| 16 | 14376 | 3.6% | |
| 17 | 11618 | 2.9% | |
| 4 | 10709 | 2.7% | |
| 18 | 9430 | 2.4% | |
| 19 | 7723 | 2.0% | |
| 20 | 5973 | 1.5% | |
| 3 | 4783 | 1.2% | |
| 21 | 4650 | 1.2% | |
| 22 | 3692 | 0.9% | |
| 23 | 2944 | 0.7% | |
| 24 | 2364 | 0.6% | |
| 25 | 1791 | 0.5% | |
| 2 | 1459 | 0.4% | |
| 26 | 1273 | 0.3% | |
| Other values (36) | 4970 | 1.3% |
| Value | Count | Frequency (%) | |
| 0 | 6 | < 0.1% | |
| 1 | 85 | < 0.1% | |
| 2 | 1459 | 0.4% | |
| 3 | 4783 | 1.2% | |
| 4 | 10709 | 2.7% | |
| 5 | 18308 | 4.6% | |
| 6 | 25927 | 6.5% | |
| 7 | 31328 | 7.9% | |
| 8 | 35137 | 8.9% | |
| 9 | 36779 | 9.3% |
| Value | Count | Frequency (%) | |
| 90 | 1 | < 0.1% | |
| 76 | 2 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 57 | 1 | < 0.1% | |
| 56 | 2 | < 0.1% | |
| 55 | 2 | < 0.1% | |
| 54 | 3 | < 0.1% | |
| 53 | 6 | < 0.1% | |
| 52 | 3 | < 0.1% | |
| 51 | 4 | < 0.1% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1781910461 |
|---|---|
| Minimum | 0 |
| Maximum | 86 |
| Zeros | 338272 |
| Zeros (%) | 85.4% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 86 |
| Range | 86 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5306706005 |
|---|---|
| Coefficient of variation (CV) | 2.978099136 |
| Kurtosis | 1867.466643 |
| Mean | 0.1781910461 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.5765642 |
| Sum | 70569 |
| Variance | 0.2816112862 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 338272 | 85.4% | |
| 1 | 49739 | 12.6% | |
| 2 | 5476 | 1.4% | |
| 3 | 1521 | 0.4% | |
| 4 | 527 | 0.1% | |
| 5 | 237 | 0.1% | |
| 6 | 122 | < 0.1% | |
| 7 | 56 | < 0.1% | |
| 8 | 34 | < 0.1% | |
| 9 | 12 | < 0.1% | |
| 10 | 11 | < 0.1% | |
| 11 | 8 | < 0.1% | |
| 13 | 4 | < 0.1% | |
| 12 | 4 | < 0.1% | |
| 19 | 2 | < 0.1% | |
| 86 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| 24 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 338272 | 85.4% | |
| 1 | 49739 | 12.6% | |
| 2 | 5476 | 1.4% | |
| 3 | 1521 | 0.4% | |
| 4 | 527 | 0.1% | |
| 5 | 237 | 0.1% | |
| 6 | 122 | < 0.1% | |
| 7 | 56 | < 0.1% | |
| 8 | 34 | < 0.1% | |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| 86 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 24 | 1 | < 0.1% | |
| 19 | 2 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| 13 | 4 | < 0.1% | |
| 12 | 4 | < 0.1% | |
| 11 | 8 | < 0.1% | |
| 10 | 11 | < 0.1% |
revol_bal
Real number (ℝ≥0)
| Distinct | 55622 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15844.53985 |
|---|---|
| Minimum | 0 |
| Maximum | 1743266 |
| Zeros | 2128 |
| Zeros (%) | 0.5% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1685 |
| Q1 | 6025 |
| median | 11181 |
| Q3 | 19620 |
| 95-th percentile | 41066.55 |
| Maximum | 1743266 |
| Range | 1743266 |
| Interquartile range (IQR) | 13595 |
Descriptive statistics
| Standard deviation | 20591.83611 |
|---|---|
| Coefficient of variation (CV) | 1.299617174 |
| Kurtosis | 384.2210931 |
| Mean | 15844.53985 |
| Median Absolute Deviation (MAD) | 6112 |
| Skewness | 11.72751512 |
| Sum | 6274913118 |
| Variance | 424023714.3 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2128 | 0.5% | |
| 5655 | 41 | < 0.1% | |
| 6095 | 38 | < 0.1% | |
| 7792 | 38 | < 0.1% | |
| 3953 | 37 | < 0.1% | |
| 6077 | 36 | < 0.1% | |
| 5098 | 36 | < 0.1% | |
| 6521 | 35 | < 0.1% | |
| 4541 | 35 | < 0.1% | |
| 5235 | 35 | < 0.1% | |
| 10362 | 35 | < 0.1% | |
| 5789 | 35 | < 0.1% | |
| 5249 | 35 | < 0.1% | |
| 5389 | 35 | < 0.1% | |
| 8502 | 35 | < 0.1% | |
| 6444 | 34 | < 0.1% | |
| 7179 | 34 | < 0.1% | |
| 9508 | 34 | < 0.1% | |
| 3997 | 34 | < 0.1% | |
| 4808 | 34 | < 0.1% | |
| 5152 | 34 | < 0.1% | |
| 5463 | 34 | < 0.1% | |
| 7618 | 34 | < 0.1% | |
| 5514 | 34 | < 0.1% | |
| 5671 | 34 | < 0.1% | |
| Other values (55597) | 393056 | 99.2% |
| Value | Count | Frequency (%) | |
| 0 | 2128 | 0.5% | |
| 1 | 30 | < 0.1% | |
| 2 | 26 | < 0.1% | |
| 3 | 28 | < 0.1% | |
| 4 | 20 | < 0.1% | |
| 5 | 23 | < 0.1% | |
| 6 | 30 | < 0.1% | |
| 7 | 21 | < 0.1% | |
| 8 | 21 | < 0.1% | |
| 9 | 23 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1743266 | 1 | < 0.1% | |
| 1298783 | 1 | < 0.1% | |
| 1190046 | 1 | < 0.1% | |
| 1030826 | 1 | < 0.1% | |
| 1023940 | 1 | < 0.1% | |
| 975800 | 1 | < 0.1% | |
| 867528 | 1 | < 0.1% | |
| 838698 | 1 | < 0.1% | |
| 814300 | 1 | < 0.1% | |
| 778614 | 1 | < 0.1% |
revol_util
Real number (ℝ≥0)
| Distinct | 1226 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 276 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.79174864 |
|---|---|
| Minimum | 0 |
| Maximum | 892.3 |
| Zeros | 2213 |
| Zeros (%) | 0.6% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11.2 |
| Q1 | 35.8 |
| median | 54.8 |
| Q3 | 72.9 |
| 95-th percentile | 92 |
| Maximum | 892.3 |
| Range | 892.3 |
| Interquartile range (IQR) | 37.1 |
Descriptive statistics
| Standard deviation | 24.45219306 |
|---|---|
| Coefficient of variation (CV) | 0.4545714479 |
| Kurtosis | 2.71227821 |
| Mean | 53.79174864 |
| Median Absolute Deviation (MAD) | 18.5 |
| Skewness | -0.07177802033 |
| Sum | 21288299.69 |
| Variance | 597.9097456 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2213 | 0.6% | |
| 53 | 752 | 0.2% | |
| 60 | 739 | 0.2% | |
| 61 | 734 | 0.2% | |
| 55 | 730 | 0.2% | |
| 54 | 725 | 0.2% | |
| 62 | 721 | 0.2% | |
| 47 | 720 | 0.2% | |
| 57 | 719 | 0.2% | |
| 58 | 717 | 0.2% | |
| 59 | 708 | 0.2% | |
| 65 | 706 | 0.2% | |
| 63 | 701 | 0.2% | |
| 46 | 698 | 0.2% | |
| 56 | 689 | 0.2% | |
| 51 | 681 | 0.2% | |
| 49 | 679 | 0.2% | |
| 48 | 671 | 0.2% | |
| 52 | 664 | 0.2% | |
| 50 | 661 | 0.2% | |
| 64 | 657 | 0.2% | |
| 69 | 654 | 0.2% | |
| 44 | 652 | 0.2% | |
| 67 | 644 | 0.2% | |
| 41 | 638 | 0.2% | |
| Other values (1201) | 376881 | 95.2% |
| Value | Count | Frequency (%) | |
| 0 | 2213 | 0.6% | |
| 0.01 | 1 | < 0.1% | |
| 0.04 | 1 | < 0.1% | |
| 0.05 | 1 | < 0.1% | |
| 0.1 | 253 | 0.1% | |
| 0.16 | 1 | < 0.1% | |
| 0.2 | 211 | 0.1% | |
| 0.3 | 187 | < 0.1% | |
| 0.4 | 189 | < 0.1% | |
| 0.46 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 892.3 | 1 | < 0.1% | |
| 153 | 1 | < 0.1% | |
| 152.5 | 1 | < 0.1% | |
| 150.7 | 1 | < 0.1% | |
| 148 | 1 | < 0.1% | |
| 146.1 | 1 | < 0.1% | |
| 145.8 | 1 | < 0.1% | |
| 140.4 | 1 | < 0.1% | |
| 136.7 | 1 | < 0.1% | |
| 132.1 | 1 | < 0.1% |
total_acc
Real number (ℝ≥0)
| Distinct | 118 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.41474383 |
|---|---|
| Minimum | 2 |
| Maximum | 151 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 17 |
| median | 24 |
| Q3 | 32 |
| 95-th percentile | 47 |
| Maximum | 151 |
| Range | 149 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.88699072 |
|---|---|
| Coefficient of variation (CV) | 0.4677202651 |
| Kurtosis | 1.204620014 |
| Mean | 25.41474383 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.8643276369 |
| Sum | 10065001 |
| Variance | 141.3005485 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 21 | 14280 | 3.6% | |
| 22 | 14260 | 3.6% | |
| 20 | 14228 | 3.6% | |
| 23 | 13923 | 3.5% | |
| 24 | 13878 | 3.5% | |
| 19 | 13876 | 3.5% | |
| 18 | 13710 | 3.5% | |
| 17 | 13495 | 3.4% | |
| 25 | 13225 | 3.3% | |
| 26 | 12799 | 3.2% | |
| 16 | 12771 | 3.2% | |
| 27 | 12343 | 3.1% | |
| 15 | 12283 | 3.1% | |
| 28 | 11706 | 3.0% | |
| 14 | 11524 | 2.9% | |
| 29 | 11274 | 2.8% | |
| 13 | 10936 | 2.8% | |
| 30 | 10587 | 2.7% | |
| 31 | 9869 | 2.5% | |
| 12 | 9858 | 2.5% | |
| 32 | 9552 | 2.4% | |
| 11 | 8844 | 2.2% | |
| 33 | 8682 | 2.2% | |
| 34 | 8088 | 2.0% | |
| 10 | 7672 | 1.9% | |
| Other values (93) | 102367 | 25.8% |
| Value | Count | Frequency (%) | |
| 2 | 18 | < 0.1% | |
| 3 | 327 | 0.1% | |
| 4 | 1238 | 0.3% | |
| 5 | 2028 | 0.5% | |
| 6 | 2923 | 0.7% | |
| 7 | 4143 | 1.0% | |
| 8 | 5365 | 1.4% | |
| 9 | 6362 | 1.6% | |
| 10 | 7672 | 1.9% | |
| 11 | 8844 | 2.2% |
| Value | Count | Frequency (%) | |
| 151 | 1 | < 0.1% | |
| 150 | 1 | < 0.1% | |
| 135 | 1 | < 0.1% | |
| 129 | 1 | < 0.1% | |
| 124 | 1 | < 0.1% | |
| 118 | 1 | < 0.1% | |
| 117 | 1 | < 0.1% | |
| 116 | 2 | < 0.1% | |
| 115 | 1 | < 0.1% | |
| 111 | 2 | < 0.1% |
initial_list_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| f | |
|---|---|
| w |
| Value | Count | Frequency (%) | |
| f | 238066 | 60.1% | |
| w | 157964 | 39.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| f | 238066 | 60.1% | |
| w | 157964 | 39.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 396030 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| f | 238066 | 60.1% | |
| w | 157964 | 39.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 396030 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| f | 238066 | 60.1% | |
| w | 157964 | 39.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 396030 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| f | 238066 | 60.1% | |
| w | 157964 | 39.9% |
application_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| INDIVIDUAL | |
|---|---|
| JOINT | 425 |
| DIRECT_PAY | 286 |
| Value | Count | Frequency (%) | |
| INDIVIDUAL | 395319 | 99.8% | |
| JOINT | 425 | 0.1% | |
| DIRECT_PAY | 286 | 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.994634245 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| I | 1186668 | 30.0% | |
| D | 790924 | 20.0% | |
| N | 395744 | 10.0% | |
| A | 395605 | 10.0% | |
| V | 395319 | 10.0% | |
| U | 395319 | 10.0% | |
| L | 395319 | 10.0% | |
| T | 711 | < 0.1% | |
| J | 425 | < 0.1% | |
| O | 425 | < 0.1% | |
| R | 286 | < 0.1% | |
| E | 286 | < 0.1% | |
| C | 286 | < 0.1% | |
| _ | 286 | < 0.1% | |
| P | 286 | < 0.1% | |
| Y | 286 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 3957889 | > 99.9% | |
| Connector Punctuation | 286 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 1186668 | 30.0% | |
| D | 790924 | 20.0% | |
| N | 395744 | 10.0% | |
| A | 395605 | 10.0% | |
| V | 395319 | 10.0% | |
| U | 395319 | 10.0% | |
| L | 395319 | 10.0% | |
| T | 711 | < 0.1% | |
| J | 425 | < 0.1% | |
| O | 425 | < 0.1% | |
| R | 286 | < 0.1% | |
| E | 286 | < 0.1% | |
| C | 286 | < 0.1% | |
| P | 286 | < 0.1% | |
| Y | 286 | < 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 286 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3957889 | > 99.9% | |
| Common | 286 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| I | 1186668 | 30.0% | |
| D | 790924 | 20.0% | |
| N | 395744 | 10.0% | |
| A | 395605 | 10.0% | |
| V | 395319 | 10.0% | |
| U | 395319 | 10.0% | |
| L | 395319 | 10.0% | |
| T | 711 | < 0.1% | |
| J | 425 | < 0.1% | |
| O | 425 | < 0.1% | |
| R | 286 | < 0.1% | |
| E | 286 | < 0.1% | |
| C | 286 | < 0.1% | |
| P | 286 | < 0.1% | |
| Y | 286 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 286 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3958175 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| I | 1186668 | 30.0% | |
| D | 790924 | 20.0% | |
| N | 395744 | 10.0% | |
| A | 395605 | 10.0% | |
| V | 395319 | 10.0% | |
| U | 395319 | 10.0% | |
| L | 395319 | 10.0% | |
| T | 711 | < 0.1% | |
| J | 425 | < 0.1% | |
| O | 425 | < 0.1% | |
| R | 286 | < 0.1% | |
| E | 286 | < 0.1% | |
| C | 286 | < 0.1% | |
| _ | 286 | < 0.1% | |
| P | 286 | < 0.1% | |
| Y | 286 | < 0.1% |
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37795 |
| Missing (%) | 9.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.813990816 |
|---|---|
| Minimum | 0 |
| Maximum | 34 |
| Zeros | 139777 |
| Zeros (%) | 35.3% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 34 |
| Range | 34 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.147930467 |
|---|---|
| Coefficient of variation (CV) | 1.184091148 |
| Kurtosis | 4.477175726 |
| Mean | 1.813990816 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.600132438 |
| Sum | 649835 |
| Variance | 4.613605292 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 139777 | 35.3% | |
| 1 | 60416 | 15.3% | |
| 2 | 49948 | 12.6% | |
| 3 | 38049 | 9.6% | |
| 4 | 27887 | 7.0% | |
| 5 | 18194 | 4.6% | |
| 6 | 11069 | 2.8% | |
| 7 | 6052 | 1.5% | |
| 8 | 3121 | 0.8% | |
| 9 | 1656 | 0.4% | |
| 10 | 865 | 0.2% | |
| 11 | 479 | 0.1% | |
| 12 | 264 | 0.1% | |
| 13 | 146 | < 0.1% | |
| 14 | 107 | < 0.1% | |
| 15 | 61 | < 0.1% | |
| 16 | 37 | < 0.1% | |
| 17 | 22 | < 0.1% | |
| 18 | 18 | < 0.1% | |
| 19 | 15 | < 0.1% | |
| 20 | 13 | < 0.1% | |
| 24 | 10 | < 0.1% | |
| 22 | 7 | < 0.1% | |
| 21 | 4 | < 0.1% | |
| 25 | 4 | < 0.1% | |
| Other values (8) | 14 | < 0.1% | |
| (Missing) | 37795 | 9.5% |
| Value | Count | Frequency (%) | |
| 0 | 139777 | 35.3% | |
| 1 | 60416 | 15.3% | |
| 2 | 49948 | 12.6% | |
| 3 | 38049 | 9.6% | |
| 4 | 27887 | 7.0% | |
| 5 | 18194 | 4.6% | |
| 6 | 11069 | 2.8% | |
| 7 | 6052 | 1.5% | |
| 8 | 3121 | 0.8% | |
| 9 | 1656 | 0.4% |
| Value | Count | Frequency (%) | |
| 34 | 1 | < 0.1% | |
| 32 | 2 | < 0.1% | |
| 31 | 2 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 28 | 1 | < 0.1% | |
| 27 | 3 | < 0.1% | |
| 26 | 2 | < 0.1% | |
| 25 | 4 | < 0.1% | |
| 24 | 10 | < 0.1% | |
| 23 | 2 | < 0.1% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 535 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1216475556 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 350380 |
| Zeros (%) | 88.5% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3561742766 |
|---|---|
| Coefficient of variation (CV) | 2.927919718 |
| Kurtosis | 18.10416044 |
| Mean | 0.1216475556 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.423440368 |
| Sum | 48111 |
| Variance | 0.1268601153 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 350380 | 88.5% | |
| 1 | 42790 | 10.8% | |
| 2 | 1847 | 0.5% | |
| 3 | 351 | 0.1% | |
| 4 | 82 | < 0.1% | |
| 5 | 32 | < 0.1% | |
| 6 | 7 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| (Missing) | 535 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 350380 | 88.5% | |
| 1 | 42790 | 10.8% | |
| 2 | 1847 | 0.5% | |
| 3 | 351 | 0.1% | |
| 4 | 82 | < 0.1% | |
| 5 | 32 | < 0.1% | |
| 6 | 7 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 8 | 2 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| 6 | 7 | < 0.1% | |
| 5 | 32 | < 0.1% | |
| 4 | 82 | < 0.1% | |
| 3 | 351 | 0.1% | |
| 2 | 1847 | 0.5% | |
| 1 | 42790 | 10.8% | |
| 0 | 350380 | 88.5% |
| Distinct | 393700 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| USS Johnson FPO AE 48052 | 8 |
|---|---|
| USCGC Smith FPO AE 70466 | 8 |
| USNS Johnson FPO AE 05113 | 8 |
| USS Smith FPO AP 70466 | 8 |
| USNS Johnson FPO AP 48052 | 7 |
| Other values (393695) |
| Value | Count | Frequency (%) | |
| USS Johnson FPO AE 48052 | 8 | < 0.1% | |
| USCGC Smith FPO AE 70466 | 8 | < 0.1% | |
| USNS Johnson FPO AE 05113 | 8 | < 0.1% | |
| USS Smith FPO AP 70466 | 8 | < 0.1% | |
| USNS Johnson FPO AP 48052 | 7 | < 0.1% | |
| USS Smith FPO AP 22690 | 6 | < 0.1% | |
| USCGC Smith FPO AA 70466 | 6 | < 0.1% | |
| USNV Smith FPO AE 30723 | 6 | < 0.1% | |
| USNV Brown FPO AA 48052 | 6 | < 0.1% | |
| USCGC Jones FPO AE 22690 | 6 | < 0.1% | |
| USNS Johnson FPO AA 70466 | 6 | < 0.1% | |
| USCGC Miller FPO AA 22690 | 6 | < 0.1% | |
| USNV Smith FPO AA 00813 | 6 | < 0.1% | |
| USCGC Williams FPO AE 00813 | 5 | < 0.1% | |
| USS Smith FPO AA 70466 | 5 | < 0.1% | |
| USCGC Smith FPO AE 48052 | 5 | < 0.1% | |
| USCGC Smith FPO AE 00813 | 5 | < 0.1% | |
| USS Williams FPO AA 30723 | 5 | < 0.1% | |
| USCGC Lee FPO AA 22690 | 5 | < 0.1% | |
| USNV Jones FPO AE 22690 | 5 | < 0.1% | |
| USNV Lewis FPO AE 29597 | 5 | < 0.1% | |
| USNS Smith FPO AE 48052 | 5 | < 0.1% | |
| USCGC Smith FPO AE 22690 | 5 | < 0.1% | |
| USNS Williams FPO AA 48052 | 5 | < 0.1% | |
| USNS Brown FPO AP 29597 | 5 | < 0.1% | |
| Other values (393675) | 395883 | > 99.9% |
Unique
| Unique | 391984 ? |
|---|---|
| Unique (%) | 99.0% |
Length
| Max length | 69 |
|---|---|
| Median length | 45 |
| Mean length | 44.71395096 |
| Min length | 20 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2128626 | 12.0% | ||
| e | 911545 | 5.1% | |
| a | 735427 | 4.2% | |
| t | 702787 | 4.0% | |
| r | 656748 | 3.7% | |
| 0 | 624825 | 3.5% | |
| i | 580043 | 3.3% | |
| o | 579480 | 3.3% | |
| n | 551350 | 3.1% | |
| 2 | 487525 | 2.8% | |
| s | 471608 | 2.7% | |
| 3 | 443992 | 2.5% | |
| 6 | 421262 | 2.4% | |
| l | 400273 | 2.3% | |
| 396030 | 2.2% | ||
| 396030 | 2.2% | ||
| 7 | 387522 | 2.2% | |
| 1 | 375962 | 2.1% | |
| 9 | 375452 | 2.1% | |
| 5 | 375301 | 2.1% | |
| , | 367706 | 2.1% | |
| h | 341828 | 1.9% | |
| 4 | 330279 | 1.9% | |
| 8 | 329800 | 1.9% | |
| u | 314299 | 1.8% | |
| Other values (42) | 4022366 | 22.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 7690715 | 43.4% | |
| Decimal Number | 4151920 | 23.4% | |
| Uppercase Letter | 2488639 | 14.1% | |
| Space Separator | 2128626 | 12.0% | |
| Control | 792060 | 4.5% | |
| Other Punctuation | 456106 | 2.6% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 624825 | 15.0% | |
| 2 | 487525 | 11.7% | |
| 3 | 443992 | 10.7% | |
| 6 | 421262 | 10.1% | |
| 7 | 387522 | 9.3% | |
| 1 | 375962 | 9.1% | |
| 9 | 375452 | 9.0% | |
| 5 | 375301 | 9.0% | |
| 4 | 330279 | 8.0% | |
| 8 | 329800 | 7.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 2128626 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 295950 | 11.9% | |
| S | 274289 | 11.0% | |
| P | 164644 | 6.6% | |
| M | 161767 | 6.5% | |
| C | 157893 | 6.3% | |
| N | 148622 | 6.0% | |
| D | 106785 | 4.3% | |
| L | 105374 | 4.2% | |
| W | 94264 | 3.8% | |
| R | 93408 | 3.8% | |
| T | 92340 | 3.7% | |
| J | 90626 | 3.6% | |
| E | 87153 | 3.5% | |
| O | 86441 | 3.5% | |
| B | 83503 | 3.4% | |
| I | 69262 | 2.8% | |
| K | 69115 | 2.8% | |
| H | 62604 | 2.5% | |
| V | 58910 | 2.4% | |
| F | 57515 | 2.3% | |
| G | 48691 | 2.0% | |
| U | 40011 | 1.6% | |
| Y | 23310 | 0.9% | |
| Z | 8879 | 0.4% | |
| X | 7103 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 911545 | 11.9% | |
| a | 735427 | 9.6% | |
| t | 702787 | 9.1% | |
| r | 656748 | 8.5% | |
| i | 580043 | 7.5% | |
| o | 579480 | 7.5% | |
| n | 551350 | 7.2% | |
| s | 471608 | 6.1% | |
| l | 400273 | 5.2% | |
| h | 341828 | 4.4% | |
| u | 314299 | 4.1% | |
| d | 186512 | 2.4% | |
| y | 165113 | 2.1% | |
| p | 160214 | 2.1% | |
| c | 139331 | 1.8% | |
| m | 131245 | 1.7% | |
| g | 116855 | 1.5% | |
| w | 111006 | 1.4% | |
| b | 99888 | 1.3% | |
| v | 98376 | 1.3% | |
| k | 91092 | 1.2% | |
| f | 60887 | 0.8% | |
| x | 39380 | 0.5% | |
| z | 33916 | 0.4% | |
| q | 8942 | 0.1% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 396030 | 50.0% | ||
| 396030 | 50.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 367706 | 80.6% | |
| . | 88400 | 19.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 10179354 | 57.5% | |
| Common | 7528712 | 42.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2128626 | 28.3% | ||
| 0 | 624825 | 8.3% | |
| 2 | 487525 | 6.5% | |
| 3 | 443992 | 5.9% | |
| 6 | 421262 | 5.6% | |
| 396030 | 5.3% | ||
| 396030 | 5.3% | ||
| 7 | 387522 | 5.1% | |
| 1 | 375962 | 5.0% | |
| 9 | 375452 | 5.0% | |
| 5 | 375301 | 5.0% | |
| , | 367706 | 4.9% | |
| 4 | 330279 | 4.4% | |
| 8 | 329800 | 4.4% | |
| . | 88400 | 1.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 911545 | 9.0% | |
| a | 735427 | 7.2% | |
| t | 702787 | 6.9% | |
| r | 656748 | 6.5% | |
| i | 580043 | 5.7% | |
| o | 579480 | 5.7% | |
| n | 551350 | 5.4% | |
| s | 471608 | 4.6% | |
| l | 400273 | 3.9% | |
| h | 341828 | 3.4% | |
| u | 314299 | 3.1% | |
| A | 295950 | 2.9% | |
| S | 274289 | 2.7% | |
| d | 186512 | 1.8% | |
| y | 165113 | 1.6% | |
| P | 164644 | 1.6% | |
| M | 161767 | 1.6% | |
| p | 160214 | 1.6% | |
| C | 157893 | 1.6% | |
| N | 148622 | 1.5% | |
| c | 139331 | 1.4% | |
| m | 131245 | 1.3% | |
| g | 116855 | 1.1% | |
| w | 111006 | 1.1% | |
| D | 106785 | 1.0% | |
| Other values (27) | 1613740 | 15.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 17708066 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2128626 | 12.0% | ||
| e | 911545 | 5.1% | |
| a | 735427 | 4.2% | |
| t | 702787 | 4.0% | |
| r | 656748 | 3.7% | |
| 0 | 624825 | 3.5% | |
| i | 580043 | 3.3% | |
| o | 579480 | 3.3% | |
| n | 551350 | 3.1% | |
| 2 | 487525 | 2.8% | |
| s | 471608 | 2.7% | |
| 3 | 443992 | 2.5% | |
| 6 | 421262 | 2.4% | |
| l | 400273 | 2.3% | |
| 396030 | 2.2% | ||
| 396030 | 2.2% | ||
| 7 | 387522 | 2.2% | |
| 1 | 375962 | 2.1% | |
| 9 | 375452 | 2.1% | |
| 5 | 375301 | 2.1% | |
| , | 367706 | 2.1% | |
| h | 341828 | 1.9% | |
| 4 | 330279 | 1.9% | |
| 8 | 329800 | 1.9% | |
| u | 314299 | 1.8% | |
| Other values (42) | 4022366 | 22.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| loan_amnt | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | purpose | title | dti | earliest_cr_line | open_acc | pub_rec | revol_bal | revol_util | total_acc | initial_list_status | application_type | mort_acc | pub_rec_bankruptcies | address | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10000.0 | 36 months | 11.44 | 329.48 | B | B4 | Marketing | 10+ years | RENT | 117000.0 | Not Verified | Jan-2015 | 0 | vacation | Vacation | 26.24 | Jun-1990 | 16.0 | 0.0 | 36369.0 | 41.8 | 25.0 | w | INDIVIDUAL | 0.0 | 0.0 | 0174 Michelle Gateway\r\nMendozaberg, OK 22690 |
| 1 | 8000.0 | 36 months | 11.99 | 265.68 | B | B5 | Credit analyst | 4 years | MORTGAGE | 65000.0 | Not Verified | Jan-2015 | 0 | debt_consolidation | Debt consolidation | 22.05 | Jul-2004 | 17.0 | 0.0 | 20131.0 | 53.3 | 27.0 | f | INDIVIDUAL | 3.0 | 0.0 | 1076 Carney Fort Apt. 347\r\nLoganmouth, SD 05113 |
| 2 | 15600.0 | 36 months | 10.49 | 506.97 | B | B3 | Statistician | < 1 year | RENT | 43057.0 | Source Verified | Jan-2015 | 0 | credit_card | Credit card refinancing | 12.79 | Aug-2007 | 13.0 | 0.0 | 11987.0 | 92.2 | 26.0 | f | INDIVIDUAL | 0.0 | 0.0 | 87025 Mark Dale Apt. 269\r\nNew Sabrina, WV 05113 |
| 3 | 7200.0 | 36 months | 6.49 | 220.65 | A | A2 | Client Advocate | 6 years | RENT | 54000.0 | Not Verified | Nov-2014 | 0 | credit_card | Credit card refinancing | 2.60 | Sep-2006 | 6.0 | 0.0 | 5472.0 | 21.5 | 13.0 | f | INDIVIDUAL | 0.0 | 0.0 | 823 Reid Ford\r\nDelacruzside, MA 00813 |
| 4 | 24375.0 | 60 months | 17.27 | 609.33 | C | C5 | Destiny Management Inc. | 9 years | MORTGAGE | 55000.0 | Verified | Apr-2013 | 1 | credit_card | Credit Card Refinance | 33.95 | Mar-1999 | 13.0 | 0.0 | 24584.0 | 69.8 | 43.0 | f | INDIVIDUAL | 1.0 | 0.0 | 679 Luna Roads\r\nGreggshire, VA 11650 |
| 5 | 20000.0 | 36 months | 13.33 | 677.07 | C | C3 | HR Specialist | 10+ years | MORTGAGE | 86788.0 | Verified | Sep-2015 | 0 | debt_consolidation | Debt consolidation | 16.31 | Jan-2005 | 8.0 | 0.0 | 25757.0 | 100.6 | 23.0 | f | INDIVIDUAL | 4.0 | 0.0 | 1726 Cooper Passage Suite 129\r\nNorth Deniseberg, DE 30723 |
| 6 | 18000.0 | 36 months | 5.32 | 542.07 | A | A1 | Software Development Engineer | 2 years | MORTGAGE | 125000.0 | Source Verified | Sep-2015 | 0 | home_improvement | Home improvement | 1.36 | Aug-2005 | 8.0 | 0.0 | 4178.0 | 4.9 | 25.0 | f | INDIVIDUAL | 3.0 | 0.0 | 1008 Erika Vista Suite 748\r\nEast Stephanie, TX 22690 |
| 7 | 13000.0 | 36 months | 11.14 | 426.47 | B | B2 | Office Depot | 10+ years | RENT | 46000.0 | Not Verified | Sep-2012 | 0 | credit_card | No More Credit Cards | 26.87 | Sep-1994 | 11.0 | 0.0 | 13425.0 | 64.5 | 15.0 | f | INDIVIDUAL | 0.0 | 0.0 | USCGC Nunez\r\nFPO AE 30723 |
| 8 | 18900.0 | 60 months | 10.99 | 410.84 | B | B3 | Application Architect | 10+ years | RENT | 103000.0 | Verified | Oct-2014 | 0 | debt_consolidation | Debt consolidation | 12.52 | Jun-1994 | 13.0 | 0.0 | 18637.0 | 32.9 | 40.0 | w | INDIVIDUAL | 3.0 | 0.0 | USCGC Tran\r\nFPO AP 22690 |
| 9 | 26300.0 | 36 months | 16.29 | 928.40 | C | C5 | Regado Biosciences | 3 years | MORTGAGE | 115000.0 | Verified | Apr-2012 | 0 | debt_consolidation | Debt Consolidation | 23.69 | Dec-1997 | 13.0 | 0.0 | 22171.0 | 82.4 | 37.0 | f | INDIVIDUAL | 1.0 | 0.0 | 3390 Luis Rue\r\nMauricestad, VA 00813 |
Last rows
| loan_amnt | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | purpose | title | dti | earliest_cr_line | open_acc | pub_rec | revol_bal | revol_util | total_acc | initial_list_status | application_type | mort_acc | pub_rec_bankruptcies | address | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 396020 | 10000.0 | 36 months | 9.76 | 321.55 | B | B3 | Retirement Counselor | 10+ years | RENT | 40000.0 | Not Verified | Dec-2015 | 0 | debt_consolidation | Debt consolidation | 23.40 | Jan-1988 | 9.0 | 0.0 | 8819.0 | 57.3 | 18.0 | w | INDIVIDUAL | 1.0 | 0.0 | 914 Alexander Mountains Apt. 604\r\nEast Marco, VT 70466 |
| 396021 | 3200.0 | 36 months | 5.42 | 96.52 | A | A1 | St Francis Medical Center | 10+ years | RENT | 33000.0 | Not Verified | Feb-2011 | 0 | debt_consolidation | 2011 Insurance and Debt Consolidation | 21.45 | Nov-1996 | 18.0 | 0.0 | 3985.0 | 7.6 | 50.0 | f | INDIVIDUAL | NaN | 0.0 | 309 John Mission\r\nWest Marc, NY 00813 |
| 396022 | 12000.0 | 36 months | 12.29 | 400.24 | C | C1 | Data Center Specialist II | 1 year | RENT | 52100.0 | Source Verified | Oct-2015 | 0 | debt_consolidation | Debt consolidation | 17.28 | Oct-2004 | 6.0 | 0.0 | 9580.0 | 66.1 | 18.0 | w | INDIVIDUAL | 0.0 | 0.0 | 532 Johnson Drive Apt. 185\r\nAndersonside, NY 70466 |
| 396023 | 22000.0 | 36 months | 18.92 | 805.55 | D | D4 | Operations Manager | 10+ years | MORTGAGE | 138000.0 | Not Verified | Apr-2014 | 0 | debt_consolidation | Debt consolidation | 24.43 | May-1998 | 18.0 | 0.0 | 22287.0 | 50.4 | 39.0 | f | INDIVIDUAL | 4.0 | 0.0 | 0297 Flores Dale Suite 441\r\nTaylorland, MD 05113 |
| 396024 | 6000.0 | 36 months | 13.11 | 202.49 | B | B4 | Michael's Arts & Crafts | 5 years | RENT | 64000.0 | Not Verified | Mar-2013 | 0 | debt_consolidation | Credit buster | 10.81 | Nov-1991 | 7.0 | 0.0 | 11456.0 | 97.1 | 9.0 | w | INDIVIDUAL | 0.0 | 0.0 | 514 Cynthia Park Apt. 402\r\nWest Williamside, SC 05113 |
| 396025 | 10000.0 | 60 months | 10.99 | 217.38 | B | B4 | licensed bankere | 2 years | RENT | 40000.0 | Source Verified | Oct-2015 | 0 | debt_consolidation | Debt consolidation | 15.63 | Nov-2004 | 6.0 | 0.0 | 1990.0 | 34.3 | 23.0 | w | INDIVIDUAL | 0.0 | 0.0 | 12951 Williams Crossing\r\nJohnnyville, DC 30723 |
| 396026 | 21000.0 | 36 months | 12.29 | 700.42 | C | C1 | Agent | 5 years | MORTGAGE | 110000.0 | Source Verified | Feb-2015 | 0 | debt_consolidation | Debt consolidation | 21.45 | Feb-2006 | 6.0 | 0.0 | 43263.0 | 95.7 | 8.0 | f | INDIVIDUAL | 1.0 | 0.0 | 0114 Fowler Field Suite 028\r\nRachelborough, LA 05113 |
| 396027 | 5000.0 | 36 months | 9.99 | 161.32 | B | B1 | City Carrier | 10+ years | RENT | 56500.0 | Verified | Oct-2013 | 0 | debt_consolidation | pay off credit cards | 17.56 | Mar-1997 | 15.0 | 0.0 | 32704.0 | 66.9 | 23.0 | f | INDIVIDUAL | 0.0 | 0.0 | 953 Matthew Points Suite 414\r\nReedfort, NY 70466 |
| 396028 | 21000.0 | 60 months | 15.31 | 503.02 | C | C2 | Gracon Services, Inc | 10+ years | MORTGAGE | 64000.0 | Verified | Aug-2012 | 0 | debt_consolidation | Loanforpayoff | 15.88 | Nov-1990 | 9.0 | 0.0 | 15704.0 | 53.8 | 20.0 | f | INDIVIDUAL | 5.0 | 0.0 | 7843 Blake Freeway Apt. 229\r\nNew Michael, FL 29597 |
| 396029 | 2000.0 | 36 months | 13.61 | 67.98 | C | C2 | Internal Revenue Service | 10+ years | RENT | 42996.0 | Verified | Jun-2010 | 0 | debt_consolidation | Toxic Debt Payoff | 8.32 | Sep-1998 | 3.0 | 0.0 | 4292.0 | 91.3 | 19.0 | f | INDIVIDUAL | NaN | 0.0 | 787 Michelle Causeway\r\nBriannaton, AR 48052 |